

1 


GAACCCAGTT 




51 


TGGCATCTCC 




101 


TTTTAAGCTA 




151 


TTCTAATCAT 




201 


GGAGACAATA 




251 


AATGACAAAT 




301 


TGGATGAACC 




351 


GATTGGGTGC 




401 


CAGCAAAAAG 




451 


CGTGGTCAGG 




501 


CTGGCCGGAT 




551 


GGGCATTTGC 




601 


GATCTAATGA 




651 


ACATGGGCAG 




701 


CATGAACTAC 




751 


CAGTTTCCCT 




801 


CCAGAGATTA 




851 


AAAATGGACT 




901 


CAGGTTTGAG 




951 


TGCGGAAATA 


'a IT 


1001 


TAAAAAAAGG 




1051 


CTTTTGGTGC 


Co 


1101 


TATTATTTTC 


C3 


1151 


GGCTGCATTT 




1201 


TCCTTTTTTA 




1251 


CCTTTTATTC 


s = 


1301 


TAGGGCAAAT 


' '"^ 


1351 


AGTATCCCGT 




1401 


GCCTTCCCTA 




1451 


GCTTTTTACA 




1501 


GAAATGACAT 


^ =^ 


1551 


GCAGGCATTG 


1601 


ATTTAAAATC 


H 


1651 


GCTTGTTCAT 




1701 


GGGATTGCGG 




1751 


TAAGGAGTGG 




1801 


CCATGGTTGG 




1851 


TCCCTGGTGG 




1901 


TCCCCTTATG 




1951 


GCAGGGAAGG 




2001 


TTGGATGCAA 




2051 


GAGACCTCGA 




2101 


TGACAGTGGA 




2151 


TTTCCTGTCA 




2201 


CAGAAGAGAC 




2251 


GTATCGTTGG 




2301 


CCAGCAGAAA 




2351 


CCCTTTTACA 




2401 


TCCGAAAGCT 




2451 


CTTGGCATTA 




2501 


AAACCAAGAC 




2551 


GAGAGAAGAA 




2601 


AACCTGAGGG 




2651 


GAAATATAAA 




2701 


GAATGGAGGA 




2751 


GGTATTTCCC 




2801 


CTGGATGCAT 




2851 


ACAGAGATGA 




2901 


GCAAAGACAC 




2951 


GCCATCTATA 




3001 


CTGTTTAATT 




3051 


AAGAGCCTTT 




3101 


AGCTGTAACA 



GCTTCAGCGA GTCGAACTAC 
CTTGCTTGCT GCAGCAGGGA 
GCAAGCTTTT TCTTTTTCTT 
GGATGCTTCT TCCGACCCTT 
TTCCCCTGAG GGAATTACAT 
GGAGGCAGCA TTAACAGTTC 
AATTCCAGGT GTTGGTACAT 
GAGAAAAATG TAAAGACAGA 
AAAGAATCAG CATGGGAAAT 
ATGGCTAGTA GTAACACTAA 
TAATAGACAT TGCTGCCGAT 
CTTAGTGCGT TGTGGTACAA 
AACAACATTT GAAGAGAGGG 
AATTAATCAT AGGTCAAGCA 
ATAATGTACA TCTTCTGGGC 
GGTAAAGGTA TTTGCTCCAT 
AAACTATTTT AAGTGGATTC 
TTAATGATTA AAACCATCAC 
TTTAGGAAAA GAAGGTCCCC 
TCTTTTCCTA CCTCTTTCCA 
GAGGTGCTAT CAGCTGCCTC 
ACCAATTGGA GGAGTTCTTT 
CTCTCAAAAC TTTATGGAGA 
GTTTTGAGGT CCATCAATCC 
TGTGGAGTAT CATACACCAT 
TTCTAGGGGT ATTTGGAGGG 
ATTGCCTGGT GTCGTCGACG 
TCTGGAAGTC ATTATTGTTG 
ATCCATACAC TAGGCTAAAC 
GACTGTGGTC CCCTGGAATC 
GAATGCCAGT AAAATTGTCG 
GAGTATATTC AGCTATATGG 
ATAATGACAG TATTCACTTT 
CCCCAGCATG GCCATTGGAG 
TGGAGCAGCT TGCCTACTAT 
TGTGAGGTCG GGGCTGATTG 
TGCTGCTGCA TGCTTAGGTG 
TTATTGTTTT TGAGCTTACT 
GCTGCAGTCA TGACCAGTAA 
CATTTATGAA GCACACATCC 
AAGAAGAATT CACTCATACC 
AGGAATGATC CTCCCTTAGC 
TGATATAGAA AACATGATTA 
TAATGTCAAA AGAATCTCAG 
CTGACAATTG CAATAGAAAG 
CAGTTCTCGG GTGTGTTTTG 
GTCCTCGGCC ATTGAAGCTT 
GTGACAGACC ACACCCCAAT 
GGGACTGAGG CAGTGCCTTG 
TAACAAAAAA AGATATCCTC 
CCCGCTTCAA TAATGTTCAA 
ACGGAAGAGG AAGTTTATTT 
AGTCATCTAC TTTTTTTTCC 
AGCCGGGTTT TTGCAACATG 
GTTGTTTGGG GAGGGTVAAGG 
GTCTAACAGA AAGCAGCGTA 
TCAGCTGAGG ATGTGCCTGA 
CAGCAGAGTC CTCGAGCACC 
ATTATCAGTC CCTATTTCTA 
AAACTGCAAG GTCTTGCCCT 
CATGAATTGT ATAGTTAAGC 
ATTTCTCTCT CTCTCTCTCT 
AAGCCTCTTT AAATCGGTGT 




AGTTTTAACC TCATCAAATA 
TGGAAGAAAT GTCACTTTCT 
TTTCTTCTTC TATTTAAAAA 
ATTTGCCTTA TGACGGGGGA 
AAAAGAGGT^ CTCATTATAC 
TACACATTTA CTGGATCTTT 
ATGATGATTT CCATACTATT 
GAAAGGCATA GACGGATCAA 
GACAAAAAGT TTGTATGATG 
CAGGATTGGC ATCAGGGGCA 
TGGATGACTG ACCTAAAGGA 
CCACGAACAG TGCTGTTGGG 
ATAAATGTCC ACAGTGGAAA 
GAGGGTCCTG GTTCTTATAT 
CTTGAGTTTT GCCTTTCTTG 
ATGCCTGTGG CTCTGGAATT 
ATCATCAGAG GTTACTTGGG 
ATTAGTCCTG GCTGTGGCAT 
TGGTACATGT TGCCTGTTGC 
AAGTATAGCA CAAACGAAGC 
AGCTGCAGGG GTTTCTGTAG 
TTAGCCTGGA AGAGGTTAGC 
TCATTTTTTG CTGCTTTAGT 
ATTTGGTAAC AGCCGTCTGG 
GGTACCTTTT TGAACTGTTT 
CTTTGGGGAG CCTTTTTCAT 
CAAGTCCACG AAATTTGGAA 
CAGCCATTAC TGCTGTGATA 
ACCAGTGAAC TGATCAAAGA 
CTCTTCTCTT TGTGACTACA 
ATGACATTCC TGATCGTCCA 
CAGTTATGCC TGGCACTCAT 
TGGCATCAAG GTTCCATCAG 
CGATCGCAGG AAGGATTGTG 
CACCACGACT GGTTTATCTT 
CATTACACCT GGCCTTTATG 
GTGTGACAAG AATGACTGTC 
GGAGGCTTGG AATATATTGT 
ATGGGTTGGA GATGCCTTTG 
GATTAAATGG ATACCCTTTC 
ACCCTGGCTG CTGACGTTAT 
TGTCCTGACA CAGGACAATA 
ATGAAACCAG CTACAATGGA 
AGATTAGTGG GATTTGCCCT 
TGCCAGGAAA AAACAAGAAG 
CACAGCACAC CCCATCTCTT 
CGAAGCATTC TTGACATGAG 
GGAGATTGTG GTGGATATTT 
TAACTCACAA TGGGCGCCTC 
CGGCATATGG CCCAGACGGC 
CTGAATCTCA CAGATGAGGA 
GTTGAATAGC ACAACTCTTT 
TCCTTTACAA AAAAAGAAAG 
GTTTGCTUU^T AATGCTGGTG 
AGAGAGAAGG AAAGGAGTGA 
TCAACTCCTA TTGTTCTGCA 
TAGTGCAGGC TTGCGCCTCA 
TGGCCTGTTG CTCCAACATT 
GAGGGATTAC TTTGAATTGA 
TTTTTTTAAT CAAAACTGTT 
ATTACCTTTC TACATTCCAG 
CTCTCTCTCT CTCTCTACTG 
ATCCTTTTGA AGCAGTCCTT 
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3151 TCTCATATTG AGATGTACTG TGATTTTACT GAGGTTTCAT CACAAGAAGG 
3201 GAGTGTTTCT TGTGCCATTA ACCATGTAGT TTGTACCATC ACTAAATGCT 
3251 TGGAACAGTA CACATGCACC ACAACAAAGG CTCATCAAAC AGGTAAAGTC 
3301 TCGAAGGAAG CGAGAACGAA ATCTCTCATT GTGTGCCGTG TGGCTCAAAA 
3351 CCGAAAACAA TGAAGCTTGG TTTTAAAGGA TAAAGTTTTC TTTTTTGTTT 
3401 TCCTCTCAGA CTTTATGGAT AATGTGACCG GGTCTTATGC AAATTTTCTA 
3451 TTTCTAAAAC TACTACTATG ATATACAAGT GCTGTTGAGC ATAATTAAAT 
3501 AAAATGCTGC TGCTTTGACA GTAAAGAGAA AAAAAAAAAA AAAAAAAAAA 
3551 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 
3601 AAAAAAAAAA AAAAAAAAAA AAAAA (SEQ ID N0:1) 

FEATURES: 

5'UTR: 1-158 

Start Codon: 159 

Stop Codon: 2532 

3*UTR: 2535 




HOMOLOGOUS PROTEINS: 

Top BLAST Hits: 



CRAI 
CRAI 
CRAI 
CRAI 
CRAI 
CRAI 
CRAI 
CRAI 
CRAI 
CRAI 



1800000510 
1800000510 
1800000522 
1800000498 
1800000523 
1800000498 
1800000497 
1000685681 
1800000498 
1800000522 



9762 

9763 

7216 

9660 

1972 

9700 

8791 

515 / 

9661 

6296 



/altid=gi 
/altid=gi 
/altid=gi 
/altid=gi 
/altid=gi 
/altid=gi 
/altid=gi 
altid=gi | 
/altid=gi 
/altid=gi 



12599548 /def =gb I AAB951 61 . 1 1 (AF029. 

12599550 /def=gb|AAB95162.1| (AF029. 

I 47 62023 /def=gb|AAD294 40.1|AF14277. 

14502869 /def=ref |NP_001820.1 I chlo. 

18134363 /def=sp|Q9R27 9|CLC3_CAVPO . 

16680948 /def=ref |NP_031737.1| chlo. 

11705905 /def=sp| P517 92 |CLC3_RAT CH. 
6634696 /def =einb I CAA71072 . 2 | (Y0994. 

1705903 /def=sp|P517 90|CLC3_HUMAN . 

4753144 /def=gb|AAB88634.2 I (U8346. 



EST: 



gil 
gil 
gil 
gil 
gil 
gil 
gil 
gil 
gil 
gil 



10993825 

10934924 

10952244 

12383593 

6591096 

10251711 

2321385 

5594360 

5422132 

10327969 



/dataset 
/dataset 
/dataset 
/dataset 
/dataset=* 
/dataset 
/dataset 
/dataset 
/dataset 
/dataset 



=dbest 
=dbest 
=dbest 
=dbest 
dbest 
=dbest 
dbest 
dbest 
dbest 
dbest 



/taxon=96 . . 

/taxon=96 . . 

/taxon=96. . 

/taxon=96 . . 
/taxon=9606 

/taxon=96. . 
/taxon=9606 
/taxon=9606 
/taxon=9606 

/taxon=96 . . 



Score 
1575 
1573 
1572 
1570 
1561 
1560 
1560 
1559 
1558 
1556 



Score 
1562 
1336 
1251 
1205 
1170 
1104 
1045 
975 
965 
963 



E 

0.0 
0.0 
0.0 
0.0 
0.0 
0.0 
0.0 
0.0 
0.0 
0.0 



E 

0.0 
0.0 
0.0 
0.0 
0.0 
0.0 
0.0 
0.0 
0.0 
0.0 



EXPRESSION INFORMATION FOR MODULATORY USE: 

library source: 

gil 10993825 Neuronal precursor cells-teratocarcinoma 

gi 110934924 Whole embryo-mainly head 

gil 10952244 Neuronal precursor cells-teratocarcinoma 

gil 12383593 Small intestine-duodenal adenocarcinoma 

gil 6591096 Lung-small cell carcinoma 

gi 1 10251711 Breast-normal 

gi I 2321385 Schwannoma tumor 

gil 5594360 Brain-tumor 

gil 5422132 Testis 

gi 1 10327969 Lung-large cell carcinoma 
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1 MDASSDPYLP YDGGGDNIPL RELHKRGTHY TMTNGGSINS STHLLDLLDE 

51 PIPGVGTYDD FHTIDWVREK CKDRERHRRI NSKKKESAWE MTKSLYDAWS 

101 GWLWTLTGL ASGALAGLID lAADWMTDLK EGICLSALWY NHEQCCWGSN 

151 ETTFEERDKC PQWKTWAELI IGQAEGPGSY IMNYIMYIFW ALSFAFLAVS 

201 LVKVFAPYAC GSGIPEIKTI LSGFIIRGYL GKWTLMIKTI TLVLAVASGL 

251 SLGKEGPLVH VACCCGNIFS YLFPKYSTNE AKKREVLSAA SAAGVSVAFG 

301 APIGGVLFSL EEVSYYFPLK TLWRSFFAAL VAAFVLRSIN PFGNSRLVLF 

351 YVEYHTPWYL FELFPFILLG VFGGLWGAFF IRANIAWCRR RKSTKFGKYP 

401 VLEVIIVAAI TAVIAFPNPY TRLNTSELIK ELFTDCGPLE SSSLCDYRND 

451 MNASKIVDDI PDRPAGIGVY SAIWQLCLAL IFKIIMTVFT FGIKVPSGLF 

501 IPSMAIGAIA GRIVGIAVEQ LAYYHHDWFI FKEWCEVGAD CITPGLYAMV 

551 GAAACLGGVT RMTVSLWIV FELTGGLEYI VPLMAAVMTS KWVGDAFGRE 

601 GIYEAHIRLN GYPFLDAKEE FTHTTLAADV MRPRRNDPPL AVLTQDNMTV 

651 DDIENMINET SYNGFPVIMS KESQRLVGFA LRRDLTIAIE SARKKQEGIV 

701 GSSRVCFAQH TPSLPAESPR PLKLRSILDM SPFTVTDHTP MEIWDIFRK 

751 LGLRQCLVTH NGRLLGIITK KDILRHMAQT ANQDPASIMF N (SEQ ID NO: 2) 



FEATURES : 

Functional domains and key regions : 

[1] PDOCOOOOl PSOOOOl ASN_GLYCOSYLATION 
N-glycosylation site 



Number of matches: 5 

1 90-93 NETT 

2 364-367 NTSE 

3 392-395 NASK 

4 587-590 NMTV 

5 598-601 NETS 



[2] PDOC00004 PS00004 CAMP_PHOSPHO_SITE 

cAMP- and cGMP-dependent protein kinase phosphorylation site 



Number of matches: 3 

1 24-27 KKES 

2 330-333 RRKS 

3 331-334 RKST 



[3] PDOC00005 PS00005 PKC_PHOSPHO_SITE 
Protein kinase C phosphorylation site 



Number of matches: 8 



1 


22 


-24 


SKK 


2 


333- 


335 


STK 


3 


529- 


531 


TSK 


4 


613- 


615 


SQR 


5 


631- 


633 


SAR 


6 


642- 


644 


SSR 


7 


658- 


660 


SPR 


8 


709- 


711 


TKK 
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[4] PDOC00006 PS00006 CK2_PH0SPH0_SITE 
Casein kinase II phosphorylation site 



Niomber of 


matches : 


13 


1 


27-30 


SAWE 


2 


34-37 


SLYD 


3 


92-95 


TTFE 


4 


93-96 


TFEE 


5 


105-108 


TWAE 


6 


217-220 


STNE 


7 


249-252 


SLEE 


8 


383-386 


SLCD 


9 


589-592 


TVDD 


10 


666-669 


SILD 


11 


674-677 


TVTD 


12 


679-682 


TPME 


13 


709-712 


TKKD 



[5] PDOC00008 PS00008 MYRISTYL 
N-myristoylation site 

18 

GLASGA 
GALAGL 
GICLSA 
GSNETT 
GLSLGK 
GNIFSY 
GVSVAF 
GAPIGG 
GVLFSL 
GVFGGL 
GGLWGA 
GLWGAF 
GVYSAI 
GAIAGR 
GAAACL 
GIYEAH 
GIVGSS 
GLRQCL 



Number of 


matches : 


1 


49-54 


2 


53-58 


3 


72-77 


4 


88-93 


5 


189-194 


6 


206-211 


7 


234-239 


8 


240-245 


9 


245-250 


10 


310-315 


11 


313-318 


12 


314-319 


13 


408-413 


14 


447-452 


15 


491-496 


16 


541-546 


17 


638-643 


18 


692-697 



Membrane spanning structure and domains: 



lix 


Begin 


End 


Score 


Certainty 


1 


99 


119 


1, 


.810 


Certain 


2 


182 


202 


2. 


.131 


Certain 


3 


233 


253 


1. 


.398 


Certain 


4 


256 


276 


1, 


.019 


Certain 


5 


290 


310 


1. 


.770 


Certain 


6 


321 


341 


0, 


.797 


Putative 


7 


361 


381 


2, 


.093 


Certain 


8 


400 


420 


1. 


.539 


Certain 


9 


473 


493 


1. 


.739 


Certain 


10 


496 


516 


1. 


.218 


Certain 


11 


540 


560 


1. 


.568 


Certain 


12 


570 


590 


0. 


.975 


Putative 
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BLAST Alignment to Top Hit: 

>CRA| 18000005109762 /altid=gi| 2599548 /def =gb | AAB95161 . 1 | (AF029346) 
chloride channel protein 3 [Homo sapiens] /org=Homo 
sapiens /taxon^9606 /dataset=nraa /length=818 
Length = 818 

Score = 1572 bits (4026), Expect =0.0 

Identities = 764/765 (99%), Positives = 764/765 (99%) 

Query: 27 GTHYTMTNGGSINSSTHLLDLLDEPIPGVGTYDDFHTIDWVREKCKDRERHRRINSKKKE 86 

GTHYTMTNGGSINSSTHLLDLLDEPIPGVGTYDDFHTIDWVREKCKDRERHRRINSKKKE 
Sbjct : 54 GTHYTMTNGGSINSSTHLLDLLDEPIPGVGTYDDFHTIDWVREKCKDRERHRRINSKKKE 113 

Query: 87 SAWEMTKSLYDAWSGWLWTLTGLASGALAGLIDIAADWMTDLKEGICLSALWYNHEQCC 14 6 

SAWEMTKSLYDAWSGWLWTLTGLASGALAGLIDIAADWMTDLKEGICLSALWYNHEQCC 
Sbjct: 114 SAWEMTKSLYDAWSGWLWTLTGLASGALAGLIDIAADWMTDLKEGICLSALWYNHEQCC 173 

Query: 147 WGSNETTFEERDKCPQWKTWAELIIGQAEGPGSYIMNYIMYIFWALSFAFLAVSLVKVFA 206 

WGSNETTFEERDKCPQWKTWAELIIGQAEGPGSYIMNYIMYIFWALSFAFLAVSLVKVFA 
Sbjct: 174 WGSNETTFEERDKCPQWKTWAELIIGQAEGPGSYIMNYIMYIFWALSFAFLAVSLVKVFA 233 

Query: 207 PYACGSGIPEIKTILSGFIIRGYLGKWTLMIKTITLVLAVASGLSLGKEGPLVHVACCCG 266 

PYACGSGIPEIKTILSGFIIRGYLGKWTLMIKTITLVLAVASGLSLGKEGPLVHVACCCG 
Sbjct: 234 PYACGSGIPEIKTILSGFIIRGYLGKWTLMIKTITLVLAVASGLSLGKEGPLVHVACCCG 293 

Query: 267 NIFSYLFPKYSTNEAKKREVLSAASAAGVSVAFGAPIGGVLFSLEEVSYYFPLKTLWRSF 326 

NIFSYLFPKYSTNEAKKREVLSAASAAGVSVAFGAPIGGVLFSLEEVSYYFPLKTLWRSF 
Sbjct: 294 NIFSYLFPKYSTNEAKKREVLSAASAAGVSVAFGAPIGGVLFSLEEVSYYFPLKTLWRSF 353 

Query: 327 FAALVAAFVLRSINPFGNSRLVLFYVEYHTPWYLFELFPFILLGVFGGLWGAFFIRANIA 386 

FAALVAAFVLRSINPFGNSRLVLFYVEYHTPWYLFELFPFILLGVFGGLWGAFFIRANIA 
Sbjct: 354 FAALVAAFVLRSINPFGNSRLVLFYVEYHTPWYLFELFPFILLGVFGGLWGAFFIRANIA 413 

Query: 387 WCRRRKSTKFGKYPVLEVIIVAAITAVIAFPNPYTRLNTSELIKELFTDCGPLESSSLCD 446 

WCRRRKSTKFGKYPVLEVIIVAAITAVIAFPNPYTRLNTSELIKELFTDCGPLESSSLCD 
Sbjct: 414 WCRRRKSTKFGKYPVLEVIIVAAITAVIAFPNPYTRLNTSELIKELFTDCGPLESSSLCD 473 

Query: 447 YRNDMNASKIVDDIPDRPAGIGVYSAIWQLCLALIFKIIMTVFTFGIKVPSGLFIPSMAI 506 

YRNDMNASKIVDDIPDRPAGIGVYSAIWQLCLALIFKIIMTVFTFGIKVPSGLFIPSMAI 
Sbjct: 474 YRNDMNASKIVDDIPDRPAGIGVYSAIWQLCLALIFKIIMTVFTFGIKVPSGLFIPSMAI 533 

Query: 507 GAIAGRIVGIAVEQLAYYHHDWFIFKEWCEVGADCITPGLYAMVGAAACLGGVTRMTVSL 566 

GAIAGRIVGIAVEQLAYYHHDWFIFKEWCEVGADCITPGLYAMVGAAACLGGVTRMTVSL 
Sbjct: 534 GAIAGRIVGIAVEQLAYYHHDWFIFKEWCEVGADCITPGLYAMVGAAACLGGVTRMTVSL 593 

Query: 567 WIVFELTGGLEYIVPLMAAVMTSKWVGDAFGREGIYEAHIRLNGYPFLDAKEEFTHTTL 626 

VVIVFELTGGLEYIVPLMAAVMTSKWVGDAFGREGIYEAHIRLNGYPFLDAKEEFTHTTL 
Sbjct: 594 WIVFELTGGLEYIVPLMAAVMTSKWVGDAFGREGIYEAHIRLNGYPFLDAKEEFTHTTL 653 

Query: 627 AADVMRPLRNDPPLAVLTQDNMTVDDIENMINETSYNGFPVIMSKESQRLVGFALRRDLT 686 

AADVMRP RNDPPLAVLTQDNMTVDDIENMINETSYNGFPVIMSKESQRLVGFALRRDLT 
Sbjct: 654 AADVMRPRRNDPPLAVLTQDNMTVDDIENMINETSYNGFPVIMSKESQRLVGFALRRDLT 713 

Query: 687 lAIESARKKQEGIVGSSRVCFAQHTPSLPAESPRPLKLRSILDMSPFTVTDHTPMEIWD 746 

lAIESARKKQEGIVGSSRVCFAQHTPSLPAESPRPLKLRSILDMSPFTVTDHTPMEIWD 
Sbjct: 714 lAIESARKKQEGIVGSSRVCFAQHTPSLPAESPRPLKLRSILDMSPFTVTDHTPMEIWD 773 

Query: 747 IFRKLGLRQCLVTHNGRLLGIITKKDILRHMAQTANQDPASIMFN 791 

IFRKLGLRQCLVTHNGRLLGIITKKDILRHMAQT7VNQDPASIMFN 
Sbjct: 774 IFRKLGLRQCLVTHNGRLLGIITKKDILRHMAQTANQDPASIMFN 818 (SEQ ID NO: 4) 
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>CRA| 18000004989660 /altid=gi | 4502869 /def=ref I NP_001820 . 1 | chloride 
channel 3; ClC-3 [Homo sapiens] /org=Homo sapiens 
/taxon=9606 /dataset-nraa /length=820 
Length = 820 

Score = 1567 bits (4013), Expect = 0.0 

Identities = 764/767 (99%), Positives = 764/767 (99%), Gaps = 2/767 (0%) 

Query: 27 GTHYTMTNGGSINSSTHLLDLLDEPIPGVGTYDDFHTIDWVREKCKDRERHRRINSKKKE 86 

GTHYTMTNGGSINSSTHLLDLLDEPIPGVGTYDDFHTIDWVREKCKDRERHRRINSKKKE 
Sbjct: 54 GTHYTMTNGGSINSSTHLLDLLDEPIPGVGTYDDFHTIDWVREKCKDRERHRRINSKKKE 113 

Query: 87 SAWEMTKSLYDAWSGWLWTLTGLASGALAGLIDIAADWMTDLKEGICLSALWYNHEQCC 14 6 

SAWEMTKSLYDAWSGWLWTLTGLASGALAGLIDIAADWMTDLKEGICLSALWYNHEQCC 
Sbjct: 114 SAWEMTKSLYDAWSGWLVVTLTGLASGALAGLIDIAADWMTDLKEGICLSALWYNHEQCC 173 

Query: 147 WGSNETTFEERDKCPQWKTWAELIIGQAEGPGSYIMNYIMYIFWALSFAFLAVSLVKVFA 206 

WGSNETTFEERDKCPQWKTWAELIIGQAEGPGSYIMNYIMYIFWALSFAFLAVSLVKVFA 
Sbjct: 174 WGSNETTFEERDKCPQWKTWAELIIGQAEGPGSYIMNYIMYIFWALSFAFLAVSLVKVFA 233 

Query: 207 PYACGSGIPEIKTILSGFIIRGYLGKWTLMIKTITLVLAVASGLSLGKEGPLVHVACCCG 266 

PYACGSGIPEIKTILSGFIIRGYLGKWTLMIKTITLVLAVASGLSLGKEGPLVHVACCCG 
Sbjct: 234 PYACGSGIPEIKTILSGFIIRGYLGKWTLMIKTITLVLAVASGLSLGKEGPLVHVACCCG 293 

Query: 267 NIFSYLFPKYSTNEAKKREVLSAASAAGVSVAFGAPIGGVLFSLEEVSYYFPLKTLWRSF 326 

NIFSYLFPKYSTNEAKKREVLSAASAAGVSVAFGAPIGGVLFSLEEVSYYFPLKTLWRSF 
Sbjct: 294 NIFSYLFPKYSTNEAKKREVLSAASAAGVSVAFGAPIGGVLFSLEEVSYYFPLKTLWRSF 353 

Query: 327 FAALVAAFVLRSINPFGNSRLVLFYVEYHTPWYLFELFPFILLGVFGGLWGAFFIRANIA 386 

FAALVAAFVLRSINPFGNSRLVLFYVEYHTPWYLFELFPFILLGVFGGLWGAFFIRANIA 
Sbjct: 354 FAALVAAFVLRSINPFGNSRLVLFYVEYHTPWYLFELFPFILLGVFGGLWGAFFIRANIA 413 

Query: 387 WCRRRKSTKFGKYPVLEVIIVAAITAVIAFPNPYTRLNTSELIKELFTDCGPLESSSLCD 446 

WCRRRKSTKFGKYPVLEVIIVAAITAVIAFPNPYTRLNTSELIKELFTDCGPLESSSLCD 
Sbjct: 414 WCRRRKSTKFGKYPVLEVIIVAAITAVIAFPNPYTRLNTSELIKELFTDCGPLESSSLCD 473 

Query: 447 YRNDMNASKIVDDIPDRPAGIGVYSAIWQLCLALIFKIIMTVFTFGIKVPSGLFIPSMAI 506 

YRNDMNASKIVDDIPDRPAGIGVYSAIWQLCLALIFKIIMTVFTFGIKVPSGLFIPSMAI 
Sbjct: 474 YRNDMNASKIVDDIPDRPAGIGVYSAIWQLCLALIFKIIMTVFTFGIKVPSGLFIPSMAI 533 

Query: 507 GAIAGRIVGIAVEQLAYYHHDWFIFKEWCEVGADCITPGLYAMVGAAACLGGVTRMTVSL 566 

GAIAGRIVGIAVEQLAYYHHDWFIFKEWCEVGADCITPGLYAMVGAAACLGGVTRMTVSL 
Sbjct: 534 GAIAGRIVGIAVEQLAYYHHDWFIFKEWCEVGADCITPGLYAMVGAAACLGGVTRMTVSL 593 

Query: 567 WIVFELTGGLEYIVPLMAAVMTSKWVGDAFGREGIYEAHIRLNGYPFLDAKE — EFTHT 624 

WIVFELTGGLEYIVPLMAAVMTSKWVGDAFGREGIYEAHIRLNGYPFLDAKE EFTHT 
Sbjct: 594 WIVFELTGGLEYIVPLMAAVMTSKWVGDAFGREGIYEAHIRLNGYPFLDAKEEFEFTHT 653 

Query: 625 TLAADVMRPLRNDPPLAVLTQDNMTVDDIENMINETSYNGFPVIMSKESQRLVGFALRRD 684 

TLAADVMRP RNDPPLAVLTQDNMTVDDIENMINETSYNGFPVIMSKESQRLVGFALRRD 
Sbjct: 654 TLAADVMRPRRNDPPLAVLTQDNMTVDDIENMINETSYNGFPVIMSKESQRLVGFALRRD 713 

Query: 685 LTIAIESARKKQEGIVGSSRVCFAQHTPSLPAESPRPLKLRSILDMSPFTVTDHTPMEIV 744 

LTIAIESARKKQEGIVGSSRVCFAQHTPSLPAESPRPLKLRSILDMSPFTVTDHTPMEIV 
Sbjct: 714 LTIAIESARKKQEGIVGSSRVCFAQHTPSLPAESPRPLKLRSILDMSPFTVTDHTPMEIV 773 

Query: 745 VDIFRKLGLRQCLVTHNGRLLGIITKKDILRHMAQTANQDPASIMFN 791 

VDIFRKLGLRQCLVTHNGRLLGIITKKDILRHMAQTANQDPASIMFN 
Sbjct: 774 VDIFRKLGLRQCLVTHNGRLLGIITKKDILRHMAQTANQDPASIMFN 820 (SEQ ID NO: 5) 



FIGURE 2, page 4 of 5 



>CRA| 1000685681515 /altid=gi | 6634 696 /def-emb I CAA7 1072 . 2 I {Y09941) 
putative chloride channel ClC-3 [Xenopus laevis] 
/org=Xenopus laevis /taxon=8355 /dataset=nraa 
/length=7 91 
Length =7 91 

Score = 1559 bits (3993), Expect =0.0 

Identities = 745/791 (94%), Positives = 771/791 (97%) 

Query: 1 MDASSDPYLPYDGGGDNIPLRELHKRGTHYTMTNGGSINSSTHLLDLLDEPIPGVGTYDD 60 

MD SSDPYLPYDGGGDNIPLR+LHKRGTHYT+TNGG+INS+THLLDLLDEPIPGVGTYDD 
Sb jet : 1 MDISSDPYLPYDGGGDNIPLRDLHKRGTHYTVTNGGAINSTTHLLDLLDEPIPGVGTYDD 60 
(SEQ ID NO: 6) 



Hmmer search results (Pfam) : 

Model Description Score E-value N 

CE00039 CE00039 chloride_channel 1671.9 0 

CE00420 E00420 CLC 1288.1 0 

PF00654 Voltage gated chloride channels 1172.4 0 

PF00571 CBS domain 78.1 7e-20 



Parsed for domains: 
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1 


AATTCTATAC 




51 


TGTTTATCTT 




101 


GAAGCATTGT 




151 


TGCTGACTGT 




201 


CAAGTTAAGA 




251 


CATTTTTTAT 




301 


CCTTTATAAA 




351 


AAACATATAA 




401 


TACAGTTAGC 




451 


AGTGTGCAGG 




501 


ATACAGAAAA 




551 


TAGCTAGAAT 




601 


TATATTCCTT 




651 


AAGAAGCTCT 




701 


ATACAGTTTT 




751 


ATTGAAGAGA 




801 


CCTTCTCTCA 




851 


TTAGAGATTG 




901 


TTGAGTTCTG 




951 


TTGTTTAGTG 




1001 


CTAGAGGAAT 




1051 


AATAAGCATT 




1101 


AGAAAGTTGC 


f 1 


1151 


ATTTCTGAAT 




1201 


GACACATTTT 




1251 


AGGAAGCTAG 


^ t 


1301 


GCGTAGTTTA 




1351 


TTCTTTCTCC 




1401 


AGGAAACAAG 




1451 


TCAACTCTTC 


f 1 


1501 


TTATAGAGTA 




1551 


GAGAAACCTT 


% J 


1601 


ATTACTGTAT 


A. Jt. 
f — 


1651 


AGAGTGCTGC 


ly 


1701 


GCATTCAAAA 




1751 


TTTATCTTAA 




1801 


CCACTGATGC 




1851 


AGTTTTGTGT 




1901 


CTGATTAAGA 




1951 


AGCAACCAGG 




2001 


GAACCCAGTT 




2051 


TGGCATCTCC 




2101 


TTTTAAGCTA 




2151 


TTCTAATCAT 




2201 


GGAGACAATA 




2251 


CTTGCTGTGA 




2301 


CTAGCTTAAA 




2351 


TAGTTGCCCT 




2401 


GTGTAGCCAT 




2451 


TCACGTCGGG 




2501 


CTGAAAATGA 




2551 


GCATCTAAAA 




2601 


AATTTACACA 




2651 


GCCAATTACC 




2701 


TTTACCTACA 




2751 


CTATATTAAT 




2801 


CATTGTCTTC 




2851 


GACTAAATGT 




2901 


CCAGCATGCC 




2951 


TTAATTGCAT 




3001 


ATTCGAAAAA 




3051 


TTCAAGAJW^ 




3101 


TTTGAATTAC 



AAATATAATT ATATAGATAT 
TAAAAATAAT TCAAATATGG 
CAGTATTTAT TTTACCGGGA 
CATTTTCATT CTTTATGATC 
GAGTGCCTAC AAATACCGGG 
GTAAAGACAT ATAAGAACAT 
TGCCATACAA TTATATATTT 
AAGAACCACA CACTCCCAAA 
CTTTGTCAAA GCTTTCCTTG 
AAGGAGCATA GGAGAAAi^AA 
TAAAGAAAAA AATCACCTAC 
CAGGATAAGA TAGAATATTC 
TCCTGGAACC CTGTCTCCCA 
GTTTATCTGA GATTTAAAAT 
TTAAAAAGAC CCTAAATAAG 
ATTTCTCTCT TCTCTTTACT 
CCTCCAATTT TCATGTAAAC 
TCTCATTCAA TACTTTAGGA 
TGGAATTGCT TCTAATGAAC 
ACACTGTGTA ACAGGCATTT 
CCTCTCAGAG GTAGCTATAA 
GTGCGGTTTT TGTTTTTGTT 
TTTATTTCTG AACTTCATAA 
TTTATAACAA TTTCTAGAAC 
TGCTAAAAGT TAAAAACACA 
TAGAGATAGG AAAGTCCTCT 
GACACATACA ATAGTAAAGT 
TCAAAAGCTA CAATGTCTTA 
AAGCCGCTGG AGGAGATTGG 
AGTTATTCGG TTTTTAAATC 
TTTTTTTTTT ATTTTTAATA 
TGGCATATTG TAGTTTTTTT 
AAAACTAGTA CATTCTCTCG 
TCTCATTTTT CTACATTAGA 
TTTCCTTTTT ACATCTCTGT 
AACCAAAAGA AATAATGTTT 
TACCGAAGCT GTATTGTGAG 
GTTGTACTTG GAGCTTAGTC 
AGGCTGTGCC GCCTCTAAGC 
CTGCAATAAT TTCCCTTTGA 
GCTTCAGCGA GTCGAACTAC 
CTTGCTTGCT GCAGCAGGGA 
GCAAGCTTTT TCTTTTTCTT 
GGATGCTTCT TCCGACCCTT 
TTCCCCTGAG GGAATTACAT 
ATTCTCTGTT GGTATGTTTT 
CTGGTTCTCG TTTGTCCTTT 
GGGTTGTTAG TAAGGGGAAA 
CCCTGATTGT TTTCTCTGTG 
CTCCGACTCC ATCTTCTGCA 
ATTAATCTGC TTTTACAGCC 
AGTAATGTTT TTCTTCCTTC 
GAGAAATCAG GTGCCTATGT 
ATGTAGATTT TACACCACAA 
TTTTAGAACA TTTTAAAATG 
ACTTTTTATT TAATATGTAT 
CACAGGTATC TTGTGAAATT 
CAGAAAAAAT AATATTGGTT 
TGTAAATCTT TTGGCAGGCA 
GTTAATTACT ACCCTTTTTT 
CAGACTGGTC GACATTTGTT 
ATCTCTTAGC TTATGTGACT 
TGCATGATAT TATAAACTCA 



ATATTACATA TACACACAAT 
CTACAAAACT TTTACAATAT 
GGATTTCCCC CATCAGTGAG 
AAGTTGTAGA TCAGGAAAAA 
AAAACTTGTG GATAGATTTT 
GAATGGTATA AAAACAAAAT 
AGAAAAATTA TATGGTGGTA 
TTTACATTGA GCTAATTTAG 
TTTAAAAAAA CTATTGGCTC 
TTGCCAAGAA TATTTGAAAA 
TATCCTATCA AAAATTTTAA 
CTGTGGCAGT AATTCTAGTC 
AATTTCAGGT GAGATTTTAT 
ATAAAAACTT GATTTAACCT 
TAAAATTTAG TACTCCACAA 
GCCCTCTGAG TTTTCTCTTT 
ACTTTCAGTT CGAGTGGACC 
AAACAAATTT TATAGAACCC 
AACACCTTTT GTTGTTGTTG 
CAGGAGGAGA ATCTCCCAGT 
AATATTGAAC TCTGATCTTC 
TTTAATGACA GTTTTAAACA 
AAATTTCTAT TAAAGAGACA 
AGTTGAGTAC CTCACTTTGA 
AAACCCTTAT GAGATAAAAT 
GCTTAGTAAA CCTCTTTTTT 
TACTTAGTAC GTTGATAGTT 
CTAGCTAGTT CCTTCAAGAA 
TGAGTGGGAT AAAACACTAT 
CTCAATGAAA GGCTGCTGTA 
GACTTAGAAC CAAGTTTCTT 
ATGGCTATGA CTCACATGAC 
TAAAACCACA CAAACTTACT 
AATGAAAAAG GGCATTGTCT 
ATTACTTTTT CCCCTTTATA 
CTATTGTTTT ACTGTAGTTA 
TGTTTCAAAA TTCTCAAACC 
ATTGTCATAC GTAGCAGGAC 
CTTGCTAGAT TGTAGCCACT 
TGACATCATC CACTGTGGAA 
AGTTTTAACC TCATCAAATA 
TGGAAGAAAT GTCACTTTCT 
TTTCTTCTTC TATTTAAAAA 
ATTTGCCTTA TGACGGGGGA 
AAAAGAGGTA ATACTATCCC 
GCATGCGGCT GGGCGGTCCT 
AAATACTGCA GTACGTTGTT 
ATGCAACCTT CTGAATGGTT 
CAGATTAGTA CTGCTTCAGA 
TGAAAATCTT CTTTCTAACT 
AACTAAAGTC GTGTTGGTTG 
AGAAAACTTA CATTTCCTTT 
ACCATTATAT TTTAGCTGCT 
AGTAAATTTA TAGCAAAAGC 
ATAGTAAAGA TGAATAATTT 
TTCGGCTGAG TAACATACTA 
TGATATGATA AAACACATTT 
TGTGAAAAGC AGAAGAGCAC 
CTTCCTCAGT CTCCTTAAAA 
TCATTTTTGT TTAATTGCTT 
GTCCTAGAAA AAAATTGAAC 
TCATTTTTGA GCCACATTAG 
CCTTATGATT TAACCCAAAC 
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3151 TTTTATTTGT AAGTATATAA GGAAGTAATA ATGTTTTTCT AATATAATTA 

3201 GCCTGCTTTA TTTAAAATAT ACTTTGTGTT CTGATAACAC TTTTTTTTTA 

3251 GTATTAAGTT CCACTATAAT TTAAACATTA TAATGTATTC AACAAATGTC 

3301 TGTTGGTTGC ATTGTGTCTG CTACACACTA TTTTAGGGTC TGAACAGTTG 

3351 TAGCATTATT TATCTTGCAG TATTCTGTAG TTAGTAAAAA CTTGCTTTTT 

3401 ACATTTTGAG AAAAGCTGTG TAAGGATCAT GTTACATACA TTGTGCTTTC 

3451 TCTTACAGAG TTACCTTCTT AATAAAATTT TGATATATGT GTATATGTAT 

3501 ATGTTAGAAC ATTTGGAAGA AATATCTAAA AGCATAAAGA AGAAAATAAT 

3551 TTCTTGTAAT CACACCACCC AGAGCTTTTT AAATTTTTTT TCTTAATGTT 

3601 ACGATCATAA ATTCTTCTAT TTCCTATGTT CTGATTATCA GTTTTCTGGT 

3651 AAGGAGTTCT TTAAACAGGA AGCAAGGTGA ATGAATAGTG ACTGTTCAAA 

3701 TGTCACATTA TTTGCTAATC AGTAATTAAA CTGTAAAACA AGACAGACTG 

3751 TATTTTCCTC ATGCTATTAC AACATTTGGT TGTTAATGAT GATAGATCAG 

3801 AATACCTGGG CTTCAGAAAT TTAAATTCCT TTTGTGAAGC TTAACAGTCT 

3851 TTGACAGAAC TTACTTATGG ACTGTCTTAG TGTAAAATAT GCAAATAATA 

3901 AGAAATAAGT CAAAACTTAT GTGAGAGTAG GCATGGTTAC TGATATTACC 

3951 TAAACGTAAG CTTTTTATTT CTATTATACT TTCATAAATA ATCCTTTAAG 

4001 AATCTTGCTT AGGATCTAAA TCAGTCCCAC TCTTGGCAGC TCAAATAGGT 

4051 TCTTTATCCC TTGATGAGAC TTATTCTATT AATATAAGTC ATTGTTATTT 

4101 GAAAGTAACA TTGTGTATGT GTAGTAGAGA TAAGTCAGTT ATTAGGCTTT 

4151 CGTGACTGTA CTGTATTACC TCAAACATAC TGTAGTATCC TAGTGTCTAT 

4201 GCGTAAGATG TTATTTTTTG TCCATAATTT ATGACCTGTT GTAGCCATGG 

4251 GTCAACACAA TGGAATTGAT GGAGACAGGC AGCTAACAAA TCGAAAAAAC 

4301 TGAATCAGCT TCCCTGTGAG GAAGAACAAA ACTATAATGA TTAAAATTGA 

4351 TCTTCAGCCT GATAGTGAAG AGGCAGATAA AGTATAAAAT TGTGAAGGAT 

4 401 ATCAATAAAG TAAACATGGA TCTGTTTAGT AAATCCCTGA GTGCTATAGC 

4 451 CAAGGATTAC CTTTGTTGAG TAAATTGAAT TTAATACTAC TTTTCAAGGC 

4501 GAGATGGTAA ATGGTGAAGC TTCCTATTTA AGTAAATAAT GTCAAGTCTG 

4551 GAAGTATAAG TAGATTCAAA TTAGAATTAG TTTGATATAC TATTGATAGA 

4 601 TTAGAAATTA AGATGACATT TCAGAAATAG CCATCTTTAG GGGTAGATTT 

4 651 CCTATATAGA AACAATCAAG CTCTCTC7VAA ATGTCTCTTC CTTTTTTATC 

4701 AGGAAAAAAG ACTTGGCTTA TCTGGACTGT TAGTTTTACA CTTTTTCTTC 

4751 TTAATTTGTT CAAGATGTTT AAGTAGTTTT AGAGGTCAAA TTTCTTTCTT 

4801 CTACCAACCC TTTATAATGG ATTTGATTCT TTTGGGCCTG AGCCTCCATT 

4851 TACTCCATGA GGGGCCTTTA ACAATTATTT AAATNNNNNN NNNNNNNNNN 

4 901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4 951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5101 NNNNNNNNNN NNNNNAAAAT AGTAATATTA ATAATAGTTA ATATTTATTA 

5151 GAATTTCCTG TTAGCTGGAT ACTGTCCCTA AGTGGGTTTT TTTGTTGTTG 

5201 TTGTTGTTGT TGTTGTTTTC TTAAGAGAGA GGTATCACTT TTTCACCCAG 

5251 GCTGGAGTGC AGTGGAGTGA TTATAGCAAA TGCAGCCTTG AACTACTGGG 

5301 CTTAGATCCT CCGTCTCACC CTCCTTGGTA CCTGGGACTG CAGGCTTGCA 

5351 ACACCTTGCC TGGCTAATTT AAAAAACAAA ATTTTTTTTT TTTTTAGGGA 

5401 GAGTCTCACT ATGTTGTCCA GGCTGGTCTC CAACTCCTGG GCTCAAGCAA 

5451 CCCTCCTGCC TTGGCCTCCC AAGTAGCTGA GATTACAGGT GCGAGCCACT 

5501 GTGCCTGGCT TGTTCTAAGT GCTTTATGTG TATGAAATTA TTTAAATCCT. 

5551 CATCACAAGT TTATGAAGTA GGTACTGTTA TAATCCCCAT TTTCTAGTTG 

5601 ACAAGACTGA GGTAAGGAAT TGTTAAGGAA AAGTCAGAAT TCCATCCAGA 

5651 TATTTGGCTC ATACTTTAAT CATGAGGCTA AACTGCTTCT CTCTACACGT 

5701 ATCTTCATAG TAACTTGTGT TTTAAGTCTG GTAGAAGCAT AAGAAGTTTA 

5751 AACACAGACA GAATCCTGTG GAAGTTAGTA AATTTCTAGT GAACGATAGA 

5801 AATGATAGAA ATCTCTTCTT CCCCCAAAGT CCCAAGAACA GATTAGTCTG 

5851 CTTTTGACAA GTGTTATCAA AGTAGACTGT TCTCACATAC ACGGGGGACT 

5901 CAATAGGGCA TTCCTGGTGG ATATAATAAA ATGAGTAAAT GCGATAACAG 

5951 GAGGAAATGC CTAGTGTGTT GCTCTTGGAT TAGTTTTGAT ACAACAAAGG 

6001 CAGCTTTGTT GTGAGTCAGT AGAGAGGGTA GTGTAGAAAG GTGGAAGTTG 

6051 GAAGAGTGGC AGATCCTAGA GGACTAATGA TGGGCTTAAA CCACAAAAAG 

6101 TGTCGCTTTG CCATTGAAAT AAAAGTTTGG GGTCTTATTT TTTCAATTTT 

6151 CTCCCTGAAA TTATTTCTTG ACATTCATTA GCTCAGCAGT GTATCTAAAT 

6201 AAAGCTTTTT TGGGTTTCTA TTATAATAGA GGTTTGTTCC TTTTTCTTCC 

6251 CTTTGAAAAG TATCATTTTT TGCACATTAT TTGAAAATCC AGGTGTTATA 
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6951 


TAGCAAATTT 




7001 


TTGCAGACAT 
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CTTAATATCC 
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TGAGGTGGCA 
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AGTATTGCTG 
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GGCATTAACT 




7251 


GGCTGAACAA 
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'W 
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7401 


ACTATCAACT 
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TCTTATCAGT 




7501 


ACTTCCTGCT 




7551 


GTATTTTGGA 
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7601 


TATGTTTTAT 




7651 


TGACGAGGAT 


ri 


7701 


AGTAACTCTT 




7751 


ATTTGGATAA 


F 


7801 


GTGATATTTA 


7851 


TGTTTTTTTT 


ia 


7901 


GGGTAAAATT 




7951 


CTATATTTTT 




8001 


ATCATGAAAC 


i J 


8051 


AAAACAAAAT 


8101 


TTACTAAACT 




8151 


AGAAACATTT 




8201 


AAGTTTGAAA 




8251 


GGCATAAGAT 




8301 


TCGTAAACTT 




8351 


TGCTCACCAC 




8401 


AAAAGTGGAC 




8451 


AGATTGTATT 




8501 


AGAAGGCATC 




8551 


CATTTCTTAA 




8601 


TGAGAATCAA 




8651 


GTGTGACCTT 




8701 


AAAGTGGAGA 




8751 


GAACTAATAT 




8801 


AATTAAAATT 




8851 


AGTGGCCTCT 




8901 


ATCTATTTTT 




8951 


ACCTGCCTAA 




9001 


AGCTCTTTAG 




9051 


AGGTAGGGGT 




9101 


GCATCATTTA 




9151 


TTAAAGAATA 




9201 


TTGGAAAAAG 




9251 


TAAGTCTTTA 




9301 


CCATAGGTAT 




9351 


AATAAGGCAA 




9401 


AAGCCAGTTT 



ATTGCCAGAG GGACATTCTG 
AGATACTTAT TATATTTTTA 
TTAAACCTCT TCTTAAA7UVT 
AACAACTAAA GATTCTTTAT 
AGTAGTATAA AACAAAGCCT 
GAATTTCAAA TCTTCAATAA 
ATTTTCTAAA AATAGCTGAC 
ATGGCTTGTC AGAAATGCTT 
TCCTGCCGCA TTGATTAAGT 
GGTTTCTGCT TTGCGTTTAT 
AAGGTTTTGT TTCTTGTTTT 
TTTGTAGCCC TTTGCAACTT 
TTTCTCTAAC AAATTTTATA 
TATAAACCTT TTTATTCATG 
AATAATTGAA TGTGGAACTT 
ACCTTATCAT CTCTAACTAA 
ACAGAACAGA AAAGAAAACA 
CTGCCTGGTT GTGTATCTTT 
TGCAAGTCTT TGCAGTTTAG 
ATTTTAATAG CGTATGCTTC 
TTAATTGACC ACCTGCTAAA 
TAATCTTTGA TTTGGAGTCT 
TAGAACGTTT ACTTTCCCAT 
AGGGTCACAG AGAGAGAAAA 
CTTAACATAC AGAAGCAAAT 
TTTCTGCCTC TTCCTTCCAT 
GTGTGTCTTA TGTAACAGTA 
CACCATTTCA CATCTTTGGA 
TTTAAATAAC TATATGCATC 
ACCCCTTTGA GAAAAGGCTT 
CTAATAAGCT CAGTTTAAGG 
TTTTTAAGTT CAGTTCAGCA 
ATATTTGTAT TTTTGGGAAA 
CTAGTTCAGT GTTTGTCAAA 
CAGTTGACTT GTGACCAGTA 
AAAATATCAG TATATACCAA 
TTGGTTTTAT TTAAGTACAT 
CTCCACTTCA TGTTTGAAAA 
ACCTGTTTGT AAGTACACCT 
GTCGGGAACA ACTGAGGGTA 
CTGCTAAAGC ATAAGGATGT 
TTCTGCAATT TTCTATTGCA 
TGCAACCAAA GGCAAAGAGG 
TATTGTCCAA AGGCTAAGTG 
ACGTAATAGT TCTTAACCTA 
AGTATACATG TAAACCAACA 
AGTGCTAGAG TTTGAATCCC 
GGGCCTGTTT AACTCTTGAC 
TAATAATATC TGTCACATTG 
ATGTAATGTC CTGAGAACAA 
AGCTGTTCTT ACTGTTATTA 
ACATGTGAAA GATTATTTTA 
TTTATTTTTG TCCCTTTTGC 
CTCAGGATCA CAAAAATTTA 
ATCTAGGATC CATTTTTAGC 
ACGGTTTCAT TCTTTTGCAC 
TTCAAAAGAC TATTCTTTCC 
ATGAATCCTT TTTTTTTTCT 
AATAATGAAT AATTTTAAGT 
TGTTTAGATT ACACATATTA 
GCTTGATCTT TATAAAGTTC 
AACAATCTTC TTAGTAGAGT 
TAGTTCATCG ATTGGATAAT 



CAGGCTCTTT GTAAAATGAT 
TTGGCCCTAA TATTTTATCC 
TAATCCATCT AAGTGTCTGT 
TTGGTGTCAG AAACTCCTTG 
GTTTTTAAAT GTACTTTTCC 
AATCTGGTTC ATATTACTAC 
ACTTTAGTAT GGTTAATTTT 
TGTATCAAGA TTTCCGAGTG 
TTGTAATTTT GGCTATTTTC 
GCAGGAGACT GGTAGTTTAA 
TAAGTTAACA TATGTTTAAT 
TAATTAGGTC ATAAAATGGA 
AATTTATGAA ATATGAAATT 
TATTGTACAG CTCATCATAT 
GTTTCCAATT ACACAGATGT 
AGGATGTGGC TTTTTATTTT 
GTGAATTGAG TAATGGGCTT 
GGTAAACTTC TTTGAGATTT 
ACAGTTAAAT ATGACTGAAT 
TTTTTTGCTA TTTATTTACC 
TGTGAGGCAC TATTCTTGCC 
GCTAACATTC TGGAACTTCC 
CCCTTACCAG GATGGCCATT 
AAAAAACCAT CTGGGGCTAG 
AGGTTGTGAA GGAATACATA 
AATTTTTTTA AAAAGGTTCA 
ATCTGCATTA TGAACTTAAA 
GATTGATCAC AGAGGTAATA 
ATTTTTCATG TAAAACTATT 
AGGCTCCTGC CAGTGTCACT 
CGCAGCAATT AAGGTTGTGT 
AATATATGTG GAAAGCTTGT 
GCAGACAATT TTATTAATGC 
CTTCAAGTTT TAACATGTTG 
TTTTAAAAGG AAAGATTAAA 
GTAGTAAGAG TAAGCATTGT 
ATCTATATAC TATGTCAGTG 
ACATTTCAAA AGCTAAGAAA 
GGGGTAAAGG TACACCCTGT 
AGAATGGGGA TGCATTACTA 
GAGTGCTGGG AGCAAAGCAG 
GCATTTTAAA TAATATGGGA 
GATGGTGATG GTGAAGGGTA 
CATATACATA TGTGTTTGGG 
CTCTGAGAGA AGGTTGTCCA 
ATGAAATTAT TTTAGTGACT 
TGTTCTACTA CTTGCTAGCG 
ACCTTGTTTT CCAAATTTAT 
TGTTGTTGTG AGGATTATAT 
TGTCTGGTAC ACATTAAGTT 
TTAGACATGA GCTAGATAAC 
ATTCTGATGT AGTTCAGTTT 
ATTGATGTCA TATCTAAAAA 
CTCCTGTATT TTATAATTTT 
TAATTTTTAT ATATGGTGTG 
GTGAATAGCC AGTTGTCCCA 
TCACTAGAAA AAATATTTCT 
TTTTAACCGC TGTTACTCAG 
AATTTTCCTA CAGGTAAATT 
GGAAATAATG GATTTGTATT 
CCTGTCTCTG GAAAAACTAA 
TATTTTTACA AGAAAGTTGC 
TTTTCCTGCT TGCTGGAGGT 
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9451 ATTTCAGTAT TGGTAATACC TGAACTATGA GGATGCATGA ATGATGCATT 

9501 TTAGGAATTT GTTTCTGTGT CCATACCAGG CATAATGAAT TAAGTTATCT 

9551 GTTAAAAATA CAGGATTTTT GCTCAATATA CAGTTGTAGA AGAACTCATT 

9601 GTCCAAATTT TTAAGACTTT TTTTTCTTTT TTTTTTTGAG ATGGATCTCG 

9651 CTCTGTCGCC CAGGTTGGAG TGCAGTGGCA CAACCTCCAC TCACTGCAAC 

9701 CTCCACCTCC AGGGTTCAAG TGATTCTGCT GCCTCAGCTT CCCGAGTAGC 

9751 TGGGGACTAC AGGCATATGC CACTATGCCC GCCTGATTTT TTTTAGTAGA 

9801 GATGGGGTTT CACCATATTG GCCAGGCTGC TCTTGAACTC CTGACCTCGT 

9851 GATCCACCCG CCTCAGCCTC CCAAAGTTCT GAGATTACAG GTGTGAGCCA 

9901 CCGCGCCCGG CCAGACATTT ttTTTTTTTT ttTTTTTTTT GCTGTCTTTG 

9951 TCATATTGTT AGTCTTTTGG TTAAGCGATA TTATAACTTA GTCATATGAG 

10001 TAATATAATG CAACATGCTG AATTGTGTGT GTGAGAGGGG GTTGTTTTTT 

10051 GTTTGTTATT TGTTTTTTAA ATAGAGATGA GATCTCACTG TGTTTCCCAG 

10101 GCTCCCTTGA ACTCCTGGGC TCAGATGATA TAGCCTCCTG CCACAGCGTC 

10151 CTGATTAGCT GGGACTACAG GTGTGCACCA CTACACGTGG CTTTCCTGAT 

10201 GAAATTTTAA ATACCCAAAT ATTTGAGCAG AAATAATAGC TTGTGTTTAT 

10251 TGTTTTTCTA CTATCTGTCA AGTATAGTAT TAAATGTTTT ACATAATTTG 

10301 TCTCCAGTCC ACATACAATA CTCTAGTAGA AGTGGGTAAC AAAACCAAGG 

10351 TACTCAAAGA GGTTAATAAG TAACTTGCGC TGGATCACAG AACTAACGGG 

f3 10401 AGGCAGGGCT GGAATTTGAC TCTAGGTCTT TCTGACCTCA AAGTGCAGTA 

^=1 10451 AAGTCATGGA ATTTCTCTAC TAGGCCACCT GGAAGAAAAG TGATCTTTTT 

10501 TCCAGTCTTT TTTGTTACTG TTTTTCAGCC AGGAGATAGT AGAGTTAGGT 

10551 AGTAGAATAG TAGTCACTGG CATCCGGTAG TCAGCCCTCC AAAAAAGTTT 

S3 10601 TTGATTTTTT TTTTTTTTTT TGTCTTAAAC TTGGAAGCTA CTAACTTTCA 

10651 GGTCATACTT TCTTATCATC CAAGAGCTGG ATATTTAGGT AGCAGAAACT 

10701 ATGGAATTAT CCTAAGTCCT CTTGAAGCTT CAGCTGTTAA AATTAATTGG 

n 10751 TTCTGATTAA CACTGTGCTC AAGATTTACA TTTCTAGGAG CCACAGTTTG 

J:^ 10801 ATTGGTCTAA CTTGGATCTA TGTGTTTTCT TTAGCTGGGG AGGAGAAGGT 

10851 ATCTTGATTG ATACCTTCAC CAGGACTGCA TGCAGTGAGG GACAGAAGTT 

2 10901 TCCTTAAAAT AATTGGGTTC TGTTATAGGA AGAAGGGGAA GGAGATACCA 

p 10951 AGTGGGCAAA ACAATCAGGT TCTATTACAT AAATAATAAA CCTAATGTGA 

=7= 11001 CGATAATAAA TGGATAATAT GATTATTTTA AGTTTGGAAA TATACCTGGT 

11051 TATTAGTATT GGATATCTGG TAGTGGGGTT GGAGAAAAAG TCGAGAATAA 

H 11101 GAAAAGACTT AAAATCGTAA AAATTAACTG GAAAAGAGGA TGGCTGAGCA 

[y 11151 GATACATATA TGTTAGATAA TGTTCATAAT GGCAAACCAA CCTGAAGATT 

Q 11201 TGTTTAAATT GTAGTATGTA GCCAGGTGTG GTGGTGCTTG CCTGCAGTCC 

11251 CAACTACTTG GGAGGCTGAG GCAGGATGAT TGCTTGAGCC TAGGTTTGAG 

11301 GCTACAGTGA GCTATGTTTC CACCACTGCT TTCCAGCCTA GGTGGCAGAG 

11351 CAAGACCCCA TCTCTAAAAA AATAAAGTAA AATGAATAAA TTATAATATG 

11401 TTATGACAAA TATAGTTATC TGAAGTCACA GAAAATGTGC ATGTGCATTT 

11451 AATGATGTGA AATAATTTTT AGGAAGTATG AATAAAAAAA TCAACTTTTA 

11501 AGTGTGGCTA GTATGATCTT ACCTGTATCT CACTTATAGA AAATATAAAA 

11551 GGCTGAAGCC AGTCACCAGT TTAATAGTTC TAACCTCTTG TTTACTTGAT 

11601 TCCCTTTTTT CTCCTCCCCA GCAATCCTCA TATAGTTAGG TAAAGTTGGT 

11651 TCTTCATCAG GCTTGTTGCA GAAACCCCTA AGCCTTTTTA CTTAAAGCTT 

11701 TTTGAAACCC AGAAACCCAT CTTTTGAATT CA7W\GTTTT GACTGTTATT 

11751 AGTCTTTTTG TATGTTTGTT GGCCGCATAA ATGTCTCCTT TTTATGAACA 

11801 GAGAAGTGTC TGTTAATATA CTTTGCCCAC TTTTTGATGG GGTTGTTTGT 

11851 TTTTTTCTTG TACATTTGTT TAAGTTCCTT GTAGATTCTG GATATTAGAC 

11901 CTATGTCAGA TGGATAGATT GCAAAAGTTT TCTCCCATTC TGTAGGTTGC 

11951 TTGTTCATTC TGATGATAGT TTCTTTTACT GTGCAGAAGC TCTTTAGTTT 

12001 AATTAGATCC TATTTGTCTG TTTTGGCTTT TGTCGCCATT GCTTTTGGTG 

12051 TTTCAGTCAT GAAGTCTTTG CCAGTGCCTA TGTCCTGAAT GGTATTGCCT 

12101 AGGTTTTCAT GGTTTTGGGT TTTACATTTA AGCCTCAAAT CGATCTTGAG 

12151 TTAATTTTTG TATAAGGTGT AAGGAAGGGG TCCAGTTCCA GTTTTCTGCA 

12201 TATGGATAGC CAGTTTTCCC AGCACCATTT ATTAATATTA AATAGGGAAT 

12251 CCTTTCCCCA TTACTTGTTT TTGTCAAGTT TGCTGAAGAT CAGATGATTG 

12301 TAGATGTGTG GTGTTATTTC TGAGGTCTTT GTTCTGTTCC GTTGGTCTGT 

12351 ATATGTGTTT TGGTACCAGT ACTATGCTGT TTTGGTTACT GAGCCTTGTA 

12401 GTATAGTTTG AAGTCAGGTA GTATGATGCC TCCAGCTTTG TTATTTTTGC 

12451 TTAGGATTGT CTTGGCCATA CGGGCTCTTT TTTGGTTCCA TATGAAATTT 

12501 AAAGTAGGTT TTTCTAATTT TGTGAGGAAA GTCAATGGTA GCTTGATGGG 

12551 AATAGCGTTG AATCTATAAA TTACTTCGGG CAGTATGGCC ATTTTCATGA 
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AGTAGGCTTT TTCTTAAAGT 
TACAAAGAAT CCCCATATAC 
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AGAAAATATT TGAAATGTCA 
AATTTTCTGG TTTTCAGAGA 
AGATGTTGAG TAGTTTCCAC 
AGACACATGC CTAATGTAAA 
CTCATAAAAA TGAGAATAGC 
AGTCTTTGCA GAAATTCATA 
ATAGCTTGAA AATTCTTTAA 
TTAAGTGGTA TAGAAATTTC 
GACCAGTAGT CCATATTTCT 
AGGGAAAATG AAGTTTGCAA 
AACTTCAGAC TTACAGAAGT 
CTGTCACCCC AATTCCTGAA 
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15751 ATATTAATAT TTTACCACAT TTGTTCATTA TGTCTGTATT CTCCAAGTAC 

15801 GATATATGCC ATTATATGTA ATATGTAGCA TTTTATATAG ACATAGGGCA 

15851 TGTATGCACT ATATATTTTT TTCTGAGCCA CATAAAGAGT TWVACGCAGA 

15901 CATGACGTGC TTTTACTCCT AAATACTTCA GTGTGTGTAT TCCCTCAAGA 

15951 AAGGGCATTT TCTTCTGTAT AGCTACCGTA CACTTCTACA CTTTTCAAAA 

16001 TCAGAACATT TACATTGATA CCATACTATG ACATGATCTG CAGACCATTT 

16051 TCCAATATGC CAGTTGTCCC ACTGTGTCCT TTAGTACAAA AGAAAAAAGT 

16101 TTTTTTTCCT GGTCTAGGAG CTAATCCTGG AGCACATGTT ACATCCTGTT 

16151 GTTTTAATCT AGAACCGTTC CTCAGTTCTT TATCTTTCAT AACCTTGACA 

16201 TTTTTGGAGA GTACAATCCA TATATTTTGC AGAATTTCCC TTAGTTTGGG 

16251 TGTGTCTGGT TTTTCCTTAT AAGATTCATT TTATGCATTT CTGGCCAGAG 

16301 TACCACAGAA GTACTGTATA TCTTACCAGA AAGCCTAAGT GGCATTTGCA 

16351 TTTTCTAAAT GATCAATTTT AATATTATAT GGAAAGCAGA GTCAGAGATT 

16401 CTCACATATG TCAAGATATT ATAAGTATTC CTGTTATATT TATTCTCCAA 

16451 TTGCTTTTTC TCAAGAAAAT TTGTGGCCTT TCAGCTAGCT TTTCAAAGTG 

16501 GAAGTTACTA CATAACATTA GGATGGGAGG GGTGGGGAAG AGCTTTATTA 

16551 AAGCTTTAAG ATTGAGCTTT TGAGTATGTG TTGTATGTAA ATGAAAGTGG 

16601 GCATTGATGC AGGGATTGGG CCTTTAAACC TTTGGCCAAG AATGGTATCA 

16651 ATTATTATTA TTATTATTTT TTGGAGTACT TCTGCTAAAA CACTGAAATC 

16701 AGTGTGCCAC TCTCCTTTTA GAAGTTTTAC ACCTTTCCAA GGTACACTTT 

16751 TTTTTTTGGA GACGAGTTTT GCTCTGTCGC CCAGGCTGGA GTGCATTGGC 

16801 GCAATCACAG CCCACTTCAG CCTCTGTTTC CCAGACTCCA GCAGTCCTTC 

16851 CACTTCAGCC TCCCGAGTAG CTGGGATTAC AGGTGCACAC CACCATGCCC 

16901 AGCTAGTTTT TGTAGAGATG GGGTTTTGCC CATGTTGCCC AGGCTGGTCT 

16951 CCAACTCCTG CGCTCAATCT ATCCGTCCTC CTCAGCCTGC CAAAGTACTG 

17001 GGATTACAGG CGTGGGCCAC CACTCCCGGC TTCCAAGGCA GGCATTTAAA 

17051 TGTAATAAAT AGGGAGATAA GCAAGAACCC TGTTGGACCT GGTAGAAGCA 

17101 AACATTTATT AGTACTATTA CGTTGTTTAA AATATTAGCG CCTTCTATAT 

17151 TCATGTCCTC CCAGAATTAT CAAAAAACCT ACTCTATAGT TTATTTGGCT 

17201 TATATCTCAG GAGTAATAAA ATTAGTTAAT AGTATTGGCA TCGTGGTTCT 

17251 TTGTGTATTC CTCCCTTATC CCACCCCAAG TTGATTTCAC ATGATCTCTT 

17301 GATCTAGTCT AAGAATGTTT ATAGTGATTA CGAGAAGTTC AGATTCTGGC 

17351 TTTAACATAT ATAATTGTTT TTTAATCTGT AAACCAAAGA GAATGAGTTT 

17 401 GTTTAAACTA GAAAGATGGC AAGAGTAGTC TGGGAATTTT GTTCCATTCC 

17451 TTAAAAGTCC TATAATAAAA TAAACATATC TTGTGTTTTA TTTTTACAAT 

17 501 TTTTTTAAAC ATTAGTACAG AGTGCCACTT CTTATATTCT ATATCAAATA 

17551 ATGAGCTACA TTTTCAATAA TAACCTCTGA GTAATTTTTG GCATTAAAAT 

17 601 GCTGCATTAC AAAATAATTT GAGGATATAA TTTATAATCA CTTATGCTAA 

17 651 AATCACCTAT TTGAAATTAT GTATGAGGTT TTCAAAGTTT ATAGTGCTTT 

17701 GGAAAAAATT TAAATGTTTC TTTGTTTATG TATCTTTATT ATAAGCTGTA 

17751 GCATATATCA TGTAGTTGTC AAGGATGCTG ATAGATACTT AATATTTAAA 

17 801 GGAGACTTGT CTAAAGTTAG CTGTCCAGGA CTAGAATCTG GGCCTTTTGG 

17 851 TAACAGCTCA TTGCTCTATT TACTTAAATG ATGATTGGAT TCGTTAGAAT 

17 901 TTCTCTATTT TCATAGCTGT CTCTATGGTT CTATGAAAAT ACTGTGTGTG 

17 951 TGCTTATACA TATATGTATA CCTGTAAGTA CAAAGTAGAA AATGAAAGTT 
18001 CATTTTCTGC TTTTGACAAT TGTAATCCCC AGAGATAACC GTTATTAATA 
18051 TGTTGTCTCA TGTTTGGTCA TACTGTTTTC TCTGTATTCT GTGTATTACT 
18101 GTATAAATTT TACACAGTAA TTTGCATATT AAAAATGCTG GTCTACACCT 
18151 GGCCCTTTTT TAAAAACTGC AATTTATTAT GGCCAATTTT TTATACCAGT 
18201 ATATATTGAT CAACCTTATT CTTTTTAACT GCTGCATTTC ATTCATTACC 
18251 AATAGATGAG ACATTTCCAT TGGTTTGAAT TTTTCAGTAT TACAGATAAT 
18301 GGTTCAATTA AATATTTAAG CTTTTGTGCA CTTGTAGi\AT TAATTCCTAG 
18351 ACATAGAACC CTTATATTTT GATAGGTATT TCCAAATTTC TTCCCAAAAT 
18401 GTTTGTATCT CTTTACTTCC ACTCTCAGGT CTAATAATTT TCACTTGGAT 
18451 TATCATATTT CTTACCCAGC CTGTTTTTTA CACTCTAAAC TCTTTTTCTT 
18501 TTCTTTTTTT TTTTGAGACA GCATCTTGCT CTTGGCCCGG TTGAAATGCA 
18551 GTGGCACGAC GACCAACCTG GGCTCAAGCA ATTCTCTCAA CTTAGCCTAC 

18 601 TGAGTAGCTG GGACTACAGA CACATATCAC CATGCCCAGC ATTTTTTTTT 
18651 TTTTTTTTTT GGATTTTTAG TAGAGATGAG GTTTTGCCAT GTTGCCCAAG 
18701 CTGGTCTCAA ATTCCTGAGC TCAAGCAATC CACCCATCTC AGCCTCCCAA 
18751 AATGCTGGGA TTACAAGCGT GAGCCACTGC ACCTGGCCCA AAAGCTCTTT 
18801 TTCTAATAGC AATATAAATT GTCTTTTACA GACTATACTC ATATATGTTT 
18851 CTTCTTTCAG AAATAGGTGT TAAGTGTATC TAACATGGAA TGTATAGCTA 
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18901 TAATTCTCAT TGTGAAACCA TAGCCTAATT TATTTCATAT TACAATTTAA 

18951 AATTCATATT TTTTAGGAAG TTTTCTTAGA TTAATCCGCC TAGTTCCAGG 

19001 TGCTACAGTC CCAAGATTTC TTTCTTTTTA ACAAATTAAA TATAGGTT^C 

19051 ATGACTAGAA TTGTAGTCAA AGAATATTGG AACCTTGGAA CTTCAGTATT 

19101 TGAACTTTAT TTTGAAATAT AATTTGTTAT ATTATAAAAA TATTATAATA 

19151 TATTGCACCT GGAAGTTAGG GGCAGTTTTT TTTAATTCTC TTTGTATCTG 

19201 CTACACTGTA AAGTGCTATT TATGTAAAAA ATTCTTAATA GAAGTCTTCA 

19251 GTTGTAAAGT CTGCTGTACA GACTTTAGAT CAGGGATTGG CAAACTATGA 

19301 GCCATGTGCC AAATCCTGCC CTTCACCTGT TTTGTAAATA AAGTTTTATC 

19351 AGAACACATT CAGACTCATT CATGAACATA TTGTCTATGA TTTATTTTCT 

19401 GCTACTATGG CAGAATTGAG TTGTTGCAAC TGTGTGGCAT CCAAAGCCTA 

19451 AAATATTTAC TCTCCTGGCT CTTTGCCAAC CCGTTTTAGA TTATGAGCAC 

19501 TTTGGCATTA TTATGTTTTT GTTTTCTTTC TATAGCACAC AGTAAGATGT 

19551 TCTGCCCACA TTGTGCATAA TTTATGGGTT TATTCAAGGA TTTATGCAAG 

19601 TGTAGCTGCA AGAAAAAAAC CTAGAAGTGA ACTTGCTAGG TTGAAGAGCA 

19651 TCTGTGTATG TTAAATTTTG TTAGCTTTCG CCTTCCCAAA GGGATTATTC 

19701 CATTTCATAC TTAAACTACT AATTTTGTGA TAGGACTTCT TTCTCCATAG 

19751 CTTTGCTAAA TTAATGCATT CACACACTTC ATCTTTACTA ATCTGATAGA 

19801 GGGAAATGAT ATTGTGGATT TGATTTGCAT TTCTTTTTAT GTGTTAGCTT 

19851 GAGCTTATTT TCATATTTAA AAGCCAATTG TATTTCTTTT TCTTGAGCTA 

19901 TCTTTTAATG TCCTTCCTGA TACATTTCTG AAGTCTGTGA TACTCATATA 

19951 AGATATATGG TGAACATGTG TCAAAGATTT ATTTGACTCT AATGAGGGAA 

20001 CCCGCCTGAT GACAAGGCTG ATTGAGAAGA GGATGTGTGA GATGAAGTGT 

20051 ATATCATCAG TGAAAGAAAG CAAATTCTTA CAGGGCAAAA ACAAAACCAC 

20101 AACTCTAAGG GTTATTGTTT CTACTGGACA GAATTCATTT GCATTTTACC 

20151 AGATAAAAAT TACTATTTTC AATTTATCTT TTACAAATCA TTTTCTAATT 

20201 TTACAGAGTC TATTCCCTAA TCAGTAGTAA ATAGTCTTCA AAATTCTCCG 

20251 CAGCGTCAGG TGACTATTAT GCAGGCTAAT TGTTGACACT CGGGCTTGAC 

20301 TTTAAGAGAA CATGCCATAA TCTTTTGGCC TTACTTCCAA GTTTTGGATA 

20351 ATTTTTCTTA ACACATTTTT CTCTAATTGC AATGATTTCA AGTGATATTA 

20401 TTTCTTTTTT TTAAATTTTT TTACTATTTA TTGATCACTC TTGGGTGTTT 

20451 CTCGGAGAGG GGGATTTGGC AGGGTCATAG GACAATAGTG GAGGGAAGGT 

20501 CAGCAGATAA ACATGTGAAC AAAGGTCTCT GGTTTTCCTA GGCAGAGGAC 

20551 CCTGCGGCCT TCCACAGTGT TTGTGTCCCT GGGTACTTGA GATTAGGGAG 

20601 TGGTGATGAC TCTTAATGAG CATGCTGCCT TCAAGCATCT GTTTAACAAA 

20651 GCACATCTTG CACCGCCCTT AATCCCTTTA ACCCTGAGTT GACATAGCAC 

207 01 ATGTTTCAGA GAGCAGGGGG TTGGGGGTAA GGTTATGGAT TAACAGCATC 

20751 CCAAGGCAGA AGAATTTTTC TTAGTACAGA ACAAAATGGA GTCTCCTGTG 

20801 TCTACTTCTT TCTACACAGA CACAGTAACA ATCTGATCTC TCTTTTCCCC 

20851 ATATTTCCCC TTTTCTATTT GACAAAACTG CCATCCTCAC CATGGCCCGT 

20901 TCTCAATGAG CTGTTGGGTA CACCTCCCAG ACAGGGTGGC GGCCAGGCAG 

20951 AGGGGCTCCT CACTTCCCAC ACTGGGCGGC CGGGCGGAGG CGCCCCCCAC 

21001 CTCCCAGACG GGGCGGCTGC CGGGCGGGGG CGCCCCCCAC CTCCCAGACT 

21051 GGGTGGCCGG GCGGAGACGC TCCTCACTTC CCAGATGGGG CGGCTGCCGG 

21101 GCGGAGGGGC TCCTCACTTC TCAGATGGGG TCGCGGCTGG GCAGAGGTGC 

21151 TCCTCACCTC CCAGACAGGG TGGCGGCTGG GCAGAGACGC TCCTCACCTC 

21201 CCAGACGGGG CAGCCGGGCA GAGGCGCTCC TCACATCCCA GAGGGGGCGG 

21251 CCGGGCAGAG GCGCTCCCCA CGTCCCAGAC GATGGGCGGC CGGGCAGAGA 

21301 CGCTCCTCAC TTCCTAGACG GGATGGCGGC GGGGAAGAGG CGCTCCTCAC 

21351 TTCCTAGATG GGATGGCGGC CGGGAAGAGG TGCTCCTCAC TTCCTAGACT 

21401 GGGCGGCCGG GCAGAGGGGC TTCTCACATC CCAGACGATG GGCAGTCAGG 

21451 CAGAGACGCT CCTCACTTCC TAGTACAGGG TGGCGGCCGG GCAGAGGCTG 

21501 CAATCTCAGC ACTTCGGGAG GCCAAGGCAG GTGGCTGGGA GGTGGGGGTT 

21551 GTAGCGAGCC GAGATCACGC CACTGCACTC CAGCCTGGGC AACATTGAGC 

21601 ACTGAGTGAG CGAGACTCCG TCTGCAATCC CGGCACCTCG GGAGGCCGAG 

21651 GCGGGCAGAT CACTCGAGGT CAGGAGCTGG AGACCAGCCC GGCCAACATG 

21701 GCGAAACCCC GTCTCCACCA AAAAACACAA AAACCAGTCA GGCGTGGCGG 

21751 CGCGTGCCTG CAATCCCAGG CACTCGGCAG GCTGAGGCAG GAGAATCAGG 

21801 CAGGAAGGTT GCAGTGAGCC GAGATCGCGG CAGTACAGTC CAGCCTCGGC 

21851 AACAGAGGGA GACCGTGGAA AGTGGGAGAC GGAGACGAGG GAGAGGGGGA 

21901 GACCGTGGAA AGCGGGAGGT GGAGACGAGG GAGAGGGAGA GGGATTATTT 

21951 CTGTATGACT TAATAATGAA TTTCTAAGAG GTCACTTAGC TCACTGTTGT 

22001 CTCTTCTT^AA ACATACTCAT CTTTCCTTTT CTCTTCTGTA GGAACTCATT 
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AAATGGAGGC AGCATTAACA 
AACCAATTCC AGGTGTTGGT 
GTGCGAGAAA AATGTAAAGA 
TAGTAAAAAT TTTTAAAAAC 
ATTTCTGCCA ATGATCTCAG 
AATGTCTACA TTTCATATGT 
ATTTTTTTAG TTTACTTATA 
AGCTAAAAAA CAAAGAAATA 
AAAAATACAG CACAGGAGTT 
GGAGGTCAGC ATGATTTTGG 
AAGGAAACTC ATTGCATAGC 
GGGGGCAGAT GGTCTTGTGG 
CAGCATTTGA TTCTGAGATA 
TTGAGTGTTG TAATGGTTTA 
AAAAGTACAA CTTATGGATC 
ATATAATCCA AAATACATTT 
GACGGAGTCT CAGTCTTCTG 
GCTCACTGCA CCCTCCGCCT 
CTCCTGAGTA GCTGAGACTA 
TTATACAAAA TACGTTTTTT 
GACTGTCGCC CATTCTGTTG 
GCTCACTGCA ACCTCTGCCT 
TCCTGAGTAG CTGGAATTAT 
TGTATTTTTA GTAGAGATGA 
GACTCCTGGC CTCAAGTGAT 
TCTTCTTGAA AAATAAAATT 
GTGTTGGAAA TTTACTTGTC 
GCTTCCATAT TAATATATAT 
TATTGTTTAT GAAAGGTACA 
TTTAGGGCAT ATCTAAAAAT 
ATCCAGTGAT TCTTAAACTT 
AAGATATTGA AGAGCTCCAA 
GGATATTTAC CATATTAGAC 
TTCATTTAAA AATAACAGTA 
CATTTTTACG AAACTATATT 
CTACATTTGT TATAAATCTC 
GATTCTCATA TCTGCTTTTG 
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ACTTATATAA TATAATACAA 
CTTTGATCTA TTCAGAAAAG 
CAGTGATAGA TACAAGTTTC 
AATTCTATCA TTGGAAACAA 
GATTACTTGG TTTCATTATT 
GAAAACCATG GTTTGATGTC 



GTTCTACACA TTTACTGGAT 
ACATATGATG ATTTCCATAC 
CAGAGAAAGG CATAGACGGG 
ATAGTGCATA ATTAGATCTT 
GCTGCCAAAT GTTTACATTT 
GGTACATGTT TTTTTCTTTT 
CCCTGTAACT TTCCAGATVAG 
CAATAAGAAG ACAAAATAAG 
GGGGGGAAGA ACAAGCCAAG 
GCTTTGAGCA GCCCACCAGC 
TCTTACCTAT GGAAAAAGAA 
GATTTTGCTG TTTTCTTTTA 
TTTCTCAATT TGGCTCCCAA 
CTGTTTTTTT TAAAAATGGC 
CTTTTTGTTT GTGGTCGTGA 
TTTATTTTGT ATTTATTTAT 
CCCATGCTGG AGTATAGTGG 
CCTGGATTCA AGCGATGCTC 
CAAACGTACG CCACCATGCC 
AAAAAACAAT TTTTTTTTTG 
CCCAAACTGG AGTGCAGTGG 
CCCAGGTTCA AGCGATTCTT 
AGGTGTGTGC CATCATGCCA 
AGTTTCGCCA TGTTGGCGAG 
TGGCTGACCT CAGCCTCCCA 
CCAAATCTCA AAAGGCCCTA 
AATGAAAATG ACTATTTACA 
GTGTGTGAAC CTGAAATTCA 
GCCTCTGAGA TTCATCAGAT 
AAAATACAGT ACATGAAATC 
TTTGCTCTCA GATCCCCTTT 
GGAGGCTTTG TTTACGTGGT 
ACTGAAACTG AGGATTTTAA 
ACAAAACCCA TTACATGTTG 
TTCAAAAATT AGTGAGAGAA 
ATTATTGTCT GGCTTAATAA 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN 
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NNNNNNNNNN NNNNNNNNNN 
CTGATTTATG TGGATCTTTT 
TATTTTAAAA ATCACATTTG 
ACTCTAAGTA TTGGGAACCT 
CTAAAATTCT GATTTTTACT 
ATACACATTT ATTTAACTTA 
GAGAAAATAC CTGTCAAATT 
ACTCTTTCAA GTAAAAATGG 
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25201 CATTCCATGT AAGAAGTGTC TAGTTTATTA TGCAACTCAA ATAAATTACG 
25251 CAAGTGCTTT TCTTTAGGAC ATAACTTCAT ACATACTTCC ACAAGCAGCA 
25301 GATGTGTGTA GTTATGCATA GTTCCTTATG CATGGTTCTT ATTTCATCAC 
25351 ACAAAATATT AAAAAGACTC AGTGATTGAG ACGTAGCAGT TTTTACTGCT 
25401 TCATCAAAGA TGCTCTTATT TGAAACTGGC ATAATATGAT TTATTTATTT 
254 51 GATTTTACTG GGAAGCATGG CAGTCAAGAA TGTAATGACT GCCAGTACAT 
25501 TTGAGTGCCA CTGCTTGATT TTTGCTATGG AGTCAGCAAT TTTGCCACTG 
25551 GTTTTGCATT TTCAGTAAAA ATGTCAACAC AGTGAAAAAG GCACATAATG 
25601 TCTTGTATTA TTTTGTAAAC AGTTTTATCT TGCAGACCCC TTGAAAAGGT 
25651 CTCGGGGATC CTCCAAGGTG CCAGTAGACC GTACTTTGAA AATCACTATT 
25701 TTAATCCAAA GTGCCTAGAT CAGACACACT ATAAATCCTG TGTCTTGTAT 
25751 GATCATTAGG TAAATACATT TGTACTTAGA AGTATACATT CAGAGACATT 
25801 AACAGTATTC AGGTTGGGAT TTAAGTATAT TTTAAAGTGT GGTACCTAGA 
25851 GAGTATCCAT GACACTATGT TCATAAAATT TTAGAGAAAA CTGAGATCAA 
25901 AGGAAACCAA AACAGGCTGG TCATAGTGGC TCATGCCTGT AATCCCAGTG 
25951 CTTTGGAAGG TTGAGGCAGA GGATCGCTGG ATCCCAGGAG TTTGAGACCA 
26001 GCCTGGGCAA ATATGGAGAC TATCTCTACA CAACAAAACA AAAATTAGCT 
26051 GGGTATAGTG TCTTGCGCCT ATAGTCCTAG CTACTCGGAA AGCTGAGGTG 
26101 GGAGGATCCC TTGAGCCTGG AAGTTCTAAG TTACAGTGAA TTATGATTGC 
26151 ACCACTGCCC TCCAACCTGG GTGAAACAGC AAGACCCTGT CACCCTCCAA 
26201 AACAAACAAA AAACACTTTT TTCTCTGAGT ATGTAAATGG TTAGTGTACA 
26251 GTCCTTGAAA ACATTGCAAA TAGTATAGCA ATATATGAAG TAGCCAGTAT 
26301 GTGTCCTAGC TAATTTTATC AATCATCTCT TCCTAGACCA ATCAAATATT 
26351 TTTCAATATT TTGATCCATG CTTATATGAA CAAGATTTTT TAAAGCTGGA 
26401 AAATTCCACA CATTTATATA CTTACTATTG TTCTTAAAAT TAATTTTTTT 
26451 TTTTTTTTTT TAAGCAGAGT CTTGCTCTTT TGCCCAGGCT GAAGTTCAGT 
26501 GGGGCGATCT CGACTCCCTG CAACCTCTGC CTTCCAGGCT CAAGCAGCTC 
2 6551 TCGTGCTTCA GCACCCCAAG TAACTGGGAT TACAGGCATA CGCCACCACA 
26601 CTGGCTAATT TTTGTAGTTT AAGTAGAGAT GTGGTTTCGC CATGTTGGCC 
26651 AGGCTGGTCT CAAACTCCCG GCCTCAAGTG ATCCACCTGT CTCAGCCTCC 
26701 CAAAATGTTG GGATTACAGG TGGGAGCCAC TGCGCCCGGC CTACATTAAA 
2 67 51 TTTTAAAGCC TTTCTATGTC AGTGCATATA CCCAACCTAA TTCTTTTTTT 
26801 CCGTGAACTT TTTTGTTATG CTTGTAGCCT TCCTACCCCA GATTATTTCG 
26851 AAGCAAATTG TCATTCTGTA ATTTCAAATA TTACTATTTC AGTATTTTAC 
26901 AAAATGGTTG CAGTTTAATT GTTGTTCCTT TTTTATTTAT TAGCTTGCAT 
26951 ATTTCTATAG AGAGTTTACC CCACATCAAC CATTTGGATT ACCTGAAGTA 
27001 AGGGTGGTAC AGGAAAGGGA GAAATCTTGA AATACTAGGT TCCTTAGCAT 
27 051 CCTCAAAGTT GACCAATGAG ATTTTTTGCT TGTTTGGTTG TTTTTTTCTG 
27101 TGTCTTCTGG ACTCATGGAT TTAAGTATAT TTGTGGTTTA ATCATCACTG 
27151 TTATTATTCT TATTGATGTT CATGTTATTT TAGATTAGTG GGAGCTTTTT 
27 201 TAGTTTGCTA TCTGTGTCCT TCGTCATGTC CTTAGATAAT CCTAATCCTA 
27251 ATCCTGATTC ATCGTAGACA TTTCCCGCAG CAAACCTGGA ATCAGCCATT 
27301 TCTCAAGGAG CTCTCTGATT CCATTGAAGG AAAATATAAT ATAGGTACAA 
27351 TCTAGGCACT AGGTGATACT TGTTACTTCT GGGTTGGCTA TTGTTTCTAG 
27401 CCTCCTAAGT TTATATGACT GTACTAATTT GAATTCATAA CTATGGGACT 
27451 AAACTTCTAA TTCTTA7VATC TGCATTTCCT TTAAGTCATG CCAAAAATCT 
27501 GAACATCACA AACATAGTCA TTTCGTTTAC CCCACAATAC ACACATACAA 
27551 CATTGTCAGT ATAACAGTAC CAACACCATC TCCAACAATA TGCCTACTGA 
27 601 AAAATTTTAG GTAATCTGTC TCCAGCCTCC CAGGTAGCTG GGACTGCAGG 
27 651 TGCACACCAC CATGCCTGGC TAATTTTTTT rprp-p^^^rprprp^ TTTTTTAAGA 
27701 GACTGGGTCC TTGCTATGTT ACTCAGGCTG GTCTGAAATT TCTGGCCTCT 
27751 AACAGTCCTC CTGCCTTTGC CTTCCAAAGT GCAGAGATTA CAGACCTGAG 
27801 CCACCACGTC TGGCCTATCC TTTATTTATT CCACCAAAGT TATTTATACA 
27 851 AATTACTTTG TTGTAAAGTC CCTTGGAATA GTTTCTTCTG TGGCATTATG 
27901 TTACCAGTTA GATGCACCTT TGATTCATTT AACTTTACTT CAATTTTTAA 
27951 GGTTTGCTTT TTAGATTTAG TTTTGTTTTA TTATACATAT ATGAAGTATT 
28001 TCCACGGTTC CAAAGTTAAA TGAACAAAAC AGGCATGTTC AAAGAAGTCT 
28051 AGTTTCTATC TCTGTCCCAT CCAACCCATT GTCTTCTTCC CCTTATAAGT 
28101 AATAATTTAC ATTTTTAACT TGTGGTTTAT CTTCTGATTT TTAAAAATAT 
28151 AAGCATAAAT ATTTATATTC CTGTCTTTTA GCATGCTTTT AGCCATCTTG 
28201 CTTTTTTCCT GTATAATGCT AAATATATCT CATTCTTTTT AATTGCTGCA 
28251 GAATTTCTCA TTACATAGGT ATACTGCAAT TTATTTATCT GATGCTATGT 
28301 TGATGAACAT TTAAATGATT TCCAGATTTT AGGAACGGTG ATGATTGAAC 
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28351 TCTCTGTACA TATATCTTTT TTACTTGGTA CACTCCATCA AGCAACTACT 
28401 TAAGTGACTG ACTATGATGC TGTGCAAGCA GTTATATAAA GAAAACAGCA 
28451 GTGACTCAGC CTGAAAACGG CTTAATATTA TCATGTTTTC TTACACATTA 
28501 TTTTTATTGA GGAAAAGCAA CATGGAGTTT AGTGATTATT TTTGAAAGAA 
28551 ATAACCTATT TCTAATTCTA AAGAATGGTT ANNNNNNNNN NNNNNNNNNN 
28601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
28651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
28701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
28751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
28801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
28851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
28901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
28951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
2 9001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
2 9051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
2 9101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNGAGTCTAG 
2 9151 CTCTGTCACC CAGGCTGGAG TGCAGTGGCA CGATCTCTGC TCACTGCCAC 
2 9201 CTCCGCCTCC CGGTTTCAAG TGATTCTCCT GCCTCAGCTT CCCAAGTAGC 
29251 TGGGATTACA GGCGTTCGCC ACCACACCCA GCTAATTTCT GTATTTTTAG 
I J 29301 TAGAGAAGGG GTTTCACTGT GTTGGCCAGA CTGGTCTTGA ACTTCTGACC 

29351 TCGTGATCCA CCTGCTTCGG ACTCCCAAAG TGCTGGGATT ACAAGCGTGA 
'IZ, 29401 GCCACCACAC CTGGCCAAAA ATATGGGTTT CTAAAGCAAC AGTCCTAGTA 

J^' 29451 CAACAGAAGA GAGGTGTTGA CTAGTTAGGG ATTTAGGTTT AGAAGTACAT 

C3 29501 TCTTAGTAAG AGAGGTGAGA CTTACCTTCT TGTGTTTTAG TATAGTGAGA 

29551 TCTGGATCAA ATCTATTACT CTTATTAATC TCCTAACTTC CTACACTATA 
29601 TCCAGTAGAG GACACTTTTG CCTTACACAG TAAAGAAAGA GCCTCTGGAC 
I'] 2 9651 TCTACCAATG GGATCGGAGC TCTCCAAACC TGCATATTAA AAGGCCTATA 

^'■4 2 97 01 AGTTTTGGGG GGTCCCTTTG TCCACATGAT TATTCTGTAA TACATTGTAT 

fU 29751 TTATGGACAT GGTATTATTA TACACAGATC CTGTCTTTTA AAGAACATTA 

s 29801 TAATCCACTT AACTGCTAGG ACCAGAGAAT GACCGATAAT TCAAACCATA 

29851 TTGTCTTACA GAAGACATAT ATAAAAGATG GTTATGTGTA CCAATTGAGG 
^r|^ 29901 TTCAAATTTG ATTCAATTTA AAACAATCTA GGCCAGATTT TATATAGTTT 

;^ 29951 GTGGACCCTT TGCACTCAAA TCTCAAGGTT CTTATTAAAA TGCAGATCTT 

30001 GGCTGGGCAC GGTGGCTCAC ACCTGTAATC CCAGCACTTT GGGAGCCCAA 
IJ 30051 GGCAGGTAGA TCATTTGAGC TCAGAAGTTC AAGACCAGTC TGGCCAACAT 

[3 30101 AGCGAGGCCC AGTCTCATTG AAAGAAAAAA AATTTTTTAA TAAAAAATAA 

12 30151 AAGCAGATCT TGGGTAAAGA CATGTAGTCT GGTTTACAGG TATTAACAAC 

30201 TGTCTGTAAT GTAGTGATTT TGCTCCAGAC TTACCTTTTC CATTATTTAG 
30251 TTCTGAAATT ACTGTTCTAT GTATGGTAAA TGAGAAAAAT TGCTAGATTC 
30301 TAGAACTGTG GCTTCTATTC ATAGTTGGAA AAATGAAGCA TAAACATTTC 
30351 TAATTTCAGA TCAACAGCAA AAAGAAAGAA TCAGCATGGG AAATGACAAA 
30401 AAGTTTGTAT GATGCGTGGT CAGGATGGCT AGTAGTAACA CTAACAGGAT 
30451 TGGCATCAGG TAAAGAAAAT TTTTCAAGCA ATCCTTTTTT AGTTAACAGA 
30501 AGTATAAACT GTTCTTCCCT CCTTCCCTCA ATTTTTTTTC AGGTACCATT 
30551 GGATTTTAAA AAGCATTTGT TTCTCTTCTT CAAAAAATCT CCTTAAATAT 
30601 AAGACTAGGA GGCAGAGGCT TCCAAGTCTA GTCTTGGCTC TATCACTTTA 
30651 CGTGTTTATC CAGCTTGGTT GATCTTTCTG GACTCAGTTT CTATATCTGT 
30701 AAAATAAGTG GTTTGGATCA GATGATCAAT AAAGTATCTT TTGATATTAA 
30751 CATCGTAATA AATAGCTAAT ATTTCTTGAG TGCTTCCTAT GTNNNNNNNN 
30801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNTGGAAGAT 
30851 TATGTTCAGA AGACCATAAA AATTAAAATT TTTGTGGAGA ATAAAGTACT 
30901 GATAATTCTA ATTGGCATGC ATAGTAATTT TATGGCCTCT GTGTATGTAA 
30951 CCCACTGATC TCTTTATGTA AGAAGGACCC AGATTTGACC ATAAATTTGT 
31001 GTATTTTTTA TATTCTCACA ATAAAATAAT CTTGATATAT GGTTTTCTGT 
31051 AATTTAAGAA AATATTATTC CTATGAGTTT CAATAATTAT TTCTAATGGA 
31101 CATTAAATTT TAATGAAATT GACATCATTT ATAAGTCTGT TAATTAAGTT 
31151 ATCGATTGAA AATTAGATTT GTGAACCTCC TGCCAAGTAG CTGTCTTTTG 
31201 AAGATATTTT AGTATCTTTT AAACATTGTT TTTCAGATCA CAATTAATTT 
31251 GAATGATGTA ACTTTTTAAA ATTCCAAACA AAAATAGCAC TTTTATTGTA 
31301 AAAAATAACT CTTTACAGTT TATAACTAAA ATTTGAAAAT CTTAAATTTA 
31351 TATGTAGTTC ATAAATGACC CTTTATTTAG GAGTCTCCTG CTTTCTACTT 
31401 GCCTTTTAAC TAGATTGTTC TCGACTCCCA AAAAATTGAC TTAATTTTTT 
31451 TACCATCTCC AACATGTTTT TATAGGGGCA CTGGCCGGAT TAATAGACAT 




31501 TGCTGCCGAT TGGATGACTG ACCTAAAGGA GGGCATTTGC CTTAGTGCGT 
31551 TGTGGTACAA CCACGAACAG TGCTGTTGGG GATCTAATGA AACAACATTT 
31601 GAAGAGAGGG ATAAATGTCC ACAGTGGAAA ACATGGGCAG AATTAATCAT 
31651 AGGTCAAGCA GAGGTAAGTC TTGCTTTGTC TCAAGATGAA TTAATAATTG 
31701 ATATAGCAAA ATGTTTCCAA TTCATTTAAT TATAGAACTA ATCACATATT 
317 51 AGATGATTAC ATACACATCA AATGGATCCA CCCTCAACAC ATTGCAGCAA 
31801 GAAAGAATTA AGTGCAATAT TGTTTCAAGT AGCTTTTTTA TTAGTTAACT 
31851 GCATAGTCAT ATAACAAATC CTCTGGATTG TGGTGCAAAT ATATTTGAGC 
31901 TGTAGTAGAA AAGAAGTGAT AGTTATTGCA GTAAGATCTG TGTAAAGTTA 
31951 CTAAGAAGTC AAGTTATTAA AACTAATATA TTACTAAGAT TGGGAAGTTT 
32001 GAATTATGAA AGTATTATCA AATAATTTAG TAAAATCAAC CTACGTAGAG 
32051 ATACATTGAA GATAATCAGA CATTTTTATT TGTGGCATTA CAGCATTTAA 
32101 ATGATTGATT TACTATGATC TACAAAGAAC ATTTTAGAAC TTAGGATGTT 
32151 ACATGTATAT TTTTTACATG ATGACATGGA TATATTTTTT AAATTTTGTT 
32201 TTAGCTGAAC TTTAGAGCTA AAAGGTATAC ATTTGCGGTA AGATGAGTAG 
32251 TATGCTGTTT CTCACCTGGC TTAATTGAAT TGAGTTTAAT GATCTGGAAA 
32301 GTTGCAGCAG AATGAAATCT GAGTGGTGAT GCAATTTGTT TCCACTGTTT 
32351 CCAAAAAGTG GTTTGTAGGC AGAGATTGAA GTATAGCTGA GATGTGTTGG 
32401 TAACAAGACT TTAGGGATTA GGAAAAAGAT TAAATGTGCT CAGGGTTCCT 
32451 TGGTATATGT AGGCATTAAT TTTTGGACTC TACTTAAATA TTTTGTTCAT 
32501 ATAAAGTTTT TATTATTGTG GAAATAAACC AGGAGACTTT TACACATTTT 
32551 ACTGAAGTTT CTTTTCTTTC tTTTTTTTTT tTTTTTTTTT TGGCCGGTGG 
32601 GATGGAGTCT CACTCTGTTG CCCAGGCTGG AGCGCAGTGG CACGATCTCG 
32651 GCTCCCTGCA ACCTCCGCTT CTGGGGTTTA AGCGATTCTT CTACCTCAGC 
32701 CTCCCGAGTA GCTGGTATTA CAGGCGTGCG ccaccatgcc cagctaattt 
32751 TTGTATTTTT AATAGCAACG GGGTTTCACC ACATTGGCCA AGCTAGTCTC 
32801 GAACTCCTGA CCTCAGGTGA TCCACCCGCC TCAACCTCCC CAGTGCTGGG 
32851 ATTACAGGCG TGAGCCACCA TGCCTGGCCG TTTACTGAAG TTTCTTATGA 
32901 CAAGCATTTG CATTAGAGGT GCAATGTAAA TTAAATTCAT ACTCTCGAAC 
32951 TATTTTCTTT TTAGGGTCCT GGTTCTTATA TCATGAACTA CATAATGTAC 
33001 ATCTTCTGGG CCTTGAGTTT TGCCTTTCTT GCAGTTTCCC TGGTAAAGGT 
33051 ATTTGCTCCA TATGCCTGTG GCTCTGGAAT TCCAGAGGTA AGCCAAGTAA 
33101 TATTTAGTGT CATTAAACAT TATTATGATG CTTATCTTTT TGACCTTAGT 
33151 GATAATAAAA GTTGGCTTTT CTGGAGGGAG GGGATAGTTT GTTCATAATA 
33201 TGAAAAAAAA ATTTTTTTAA GTATAAGCTG ATGGTAGACA TCATTGAAAA 
33251 ATATTGTTCC GCATAGTCAT TTGGTCATTT ACTGTGAAGG CTGATTTTTT 
33301 TTTTCTCTCA CCACTAATTT AACACATGAC TAGGCAAATT TTCAGACTAT 
33351 TTAGTTAAAC ATCAAGAGCC TGGAAGAAGT ATCTTGTGAC CTAATGTTCT 
33401 TTGACGGGTT AGTTGTTACT TTGCTGTTAT GACCCTGAAT TTTTTTTTTT 
33451 TGAGACTGAG TCTTGTGCTG TCGCCCAGAC TGGAGTGCAG TGGCGCAATC 
33501 TCAGCTCACT GCAACCTCTG CGTCCCAGGC TCAAGCAATT CTTGTGTCTC 
33551 AGCCTCCTGA GGAGTTGCGA TTGCAGGCAC CTGTCACCAT GCCCTGCTAA 
33601 TTTTTGCATT TTTTTGTTTG TTTTTTTTTT TTAGTAGAGA TGGGGTTTCA 
33651 CCATGTTGGC CAGGCTGGTC TCAAACTCCT AACCTCAAGT GATCACCCGC 
33701 CTCAGCCTCC CAAAGTGCTG GGATTACAGG TGTGAGCCAC CACACGTGGC 
33751 TATGACCCTG ATTTTGATTC ATTCACTTTT TATAATTACC TTTTGATTAG 
33801 ATAAGTTAAT TATTCTTGAA TTTGGCCATT TTATGCTTTG AGAAAGTAGT 
33851 TAATCACAGT GGGTCAACAG TACAAACTTT TGGGTTTTAT TTTTCATCAC 
33901 AATAAAGTAG AGTTATACAT AGGATTGATT GAACTTGATT TGAACTTATC 
33951 TCTTCTCTTT TATTTTTCTG GAGTTAAATA AGTTACCAAC TTTTTCCTAA 
34001 TACATTTCTT TTTAAAATGG AATTGTATTG ATCCTTTAAG TTTGTATTAA 
34051 GAATATCTTT CATAAAAAGC AATATCATGC AGTATATAAC AGTTGTTACT 
34101 CATTCTTGAT ACATAAAAAA CTATTGCACA TAATTACAGG ACCTCAGAGA 
34151 AAACATAATA TTCTTATTTC TAACATAATG GCCAAAATAT ATTTAAAATA 
34201 TTATGCTTAT TTTTACAACA GAAATATTCA AATTTGCCCT TTTTTTGGGT 
34251 ATGTAATTAT AATCCTTATA ATTAAGGTCT GTATTCATTT TAACATGGCC 
34301 TGATATTTTG ATTTTGGCCT GAGATAGTGT TGCCCTCTCT CCTTTCTTGG 
34351 GTAGAGAATT AGATTATAAT ATCAATTTAT TATATGTAGC ATAATAGGCA 
34401 AGTTTTCGAA AAATTAACTG TAAATTTTTC TGTAGACTGC TAAAATTTGC 
34451 AAGGTTGTTT TTGTGCATAA AACAAGAAAA TAACTTGGAT TCGTTACATT 
34501 CTCATGTTTC TTAAAGGACA TTAAGCTGCC TTAATCTTTG CCTTGTAGAT 
34 551 TAAAACTATT TTAAGTGGAT TCATCATCAG AGGTTACTTG GGAAAATGGA 
34 601 CTTTAATGAT TAAAACCATC ACATTAGTCC TGGCTGTGGC ATCAGGTTTG 
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34 651 AGTTTAGGAA AAGAAGGTCC CCTGGTACAT GTTGCCTGTT GCTGCGGAAA 
34701 TATCTTTTCC TACCTCTTTC CAAAGTATAG CACAAACGAA GCTAAAAA7VA 
34 751 GGGAGGTAAG TGTCTTTTGT AGTTAATTTG ACTGAAAAAT ATATATTATA 
34 801 TAGTATTTAT TTAAGTAAAG AATTTCTTAG TGTAAAAATA ATAAATTCTG 
34 851 TATTCAGATA AAAAATTTTG AGATTTGTGC TTCTGTTTTT CCTGAATAAT 
34 901 CTATAACATC TTTCTAGAAT CCATTCCCAG TGCTGCTCAG TTCGTCTTAC 
34 951 ATTTTAGAGA AGCTTTAGAT AGACAGCTGG TGTCCATTGG GTTTCAGCTG 
35001 CATTTCACGA AGATCTTCCT GTTATCACTT TACCTTACAT CTTTCCTCTT 
35051 CTGAAGTGTT TTCTAAGCTT AGCTTTGTTT TTCACTCTTA CTTTCAACAT 
35101 TAAGAGGTTG GGAAATCTTA ATAGCTATGT TTTCCTCCTG GAGGCAGTGT 
35151 CTGGTGCCAG TGTAAGTGGT GTGTGATATG AAAAATGCTA TCCAGTGCTA 
35201 TGGGGAAGTT CTGAGGGCCT TTAGAAGCTC TTGAAGTTTA AATCAGAAAT 
35251 TCACATTAAA GAGATTACAG GAAATCCTTT TCATTTGATT GTTTAAGGCA 
35301 ATTTCCTTTA CCATTTCTTT AGGCCAGCCT GAGATCTTCT ACAAGACCTT 
35351 GAAACCTTAT ATATATTATG GATTTCCTCT GATGTTTCCA TATTGCTCTG 
35401 GGCATTTTCC TGAATCCTTT ATATTAGCTC TAGACTTTGG GAGCCCAGTC 
35451 CCTTCCTATT TTCCAAATCT AAATCTACAG CCCTAGATGG TACAGAGATC 
35501 TTTGAGTTTT TAAGATATGA TTTTTTGAAA AACATCTCAT TAAATACTGG 
35551 CAGAACCTTT TCATCTTGTT GAGTTTTTTA ATGTACTGTA ACCAAAAAAG 
35601 TAGAATATTT TATCAAACTG TTTAATCTTC AATTGAAATA ATTCTAGTAC 
35651 ATTTTAATGT TCGCATTAAA ATATTGTCCT TGCATTGGAC GTAGATATCC 
35701 CAAAAGTGGA ATACTTCAGA TTGTCGTAGT TTCATCTCTG AATAATTGTG 
35751 ATTCCAGTAC TTTATAACAA AAATAGCTAG CATTATTGAT TACTTTCTGT 
35801 GTATCTGGTA CTGTGGCAGA TACTTTACTT GGATTTTAAT ACTTAATTTC 
35851 ACAGTAATTT AGTAATATGG CCCTGTTATC CTCATTTAGT GATTAGTAAA 
35901 CTAGGGCTGA AAACAGCTAA CTAACTTGCC CGAGACTACA TACCTAGTAA 
35951 GTGGTGGAAC GTAGGTTAAA ATTCATTTTT CTTTGACTTC AAAGTCTGTG 
36001 GTCTTACCTA CTTACATTAC TGCCCTTACG ACTATGTGGG TATATATTTG 
36051 TGTGTGTTCA AAACAAACTC AAAACCATCC TGTAGCGTAG CAAGTTAGTG 
36101 GCTAAGATGA AGCTAGAGCA TTTGCCTCCT CAATTCAATT CCATTACTTT 
36151 CTGTTGTACC TTTATATTTT TTGGTAAGAC TTTTACTTAT TCTAAGTTCA 
36201 AAAAATGTAA TTTATTAGAT GTTTGAGAAA TTAAGTTTAC CTAAATTTTA 
36251 ATGTTCATAC TGTAGTGATT AGTTAATGTT TAATACGTTG TTATTCTGTC 
36301 ACCTTAGTGT ATATATAAAT GGCAAGAATT CACGGTTAGT TGAAAGCATT 
36351 AAGGTCCCAT AGTTTTGTGT AGACAAGAGG GGAGAGCGTT GATATTTTTA 
36401 AATTAAATGC TTCTTAGATA CGTATGAAAT GGATTAAAAC ATGTATATGA 
364 51 GTTATAGATA CCTAGGTGTT AGTTTGGTTG TAAATTCAGG ATCAGGACAT 
36501 TCAAATAAAT ATGTTTGCTT TCCTCTTAGT GGAGGAAAAA AAAAAGAAGC 
36551 TAAATTTGCT CCCTTTCCTC CCCAAATAAG CAGAGTCTAC ATTTTAATGC 
36601 CAACAATTTG ATTAAAACAA ATATTTATTT ATTTTTAATT CACCAAACTT 
36651 TTATAAAGTA TTTACTGGTG CCAGGCACTG TTCTAAAGCA CTCTGTATAT 
36701 ATTTACTCAG TCCTTAAGAG CTAAGTAATA TTATCACGTT TCCATTTTAG 
36751 AGAAAACTGA GGCACATATA GGTTAGGTTA TCTACCCATA GCCATACAGC 
36801 TAGTAAGTAG CAGAGCCATG ATTTCAACAC AGCAGCCTGA CTATGGAGTT 
36851 CATGATCTTA ACCATTTACA GCTTAATTTT TATTATTTAT AATTTCTCTT 
36901 CTGGAAATGT AACAATTGAC CATTTGAAGA AATACTTTAG GTAGCTTTGG 
36951 ATATTTGCTG TATTAAAGTA GTGAAAGTAA TACAGACACT TGGCTGGGCG 
37001 CGGTGGCTCA CGCCTATAAT CCCAGCATTT TGGTAGGTTG AGGCAGGCAG 
37051 ATCACCTAAG GTCAGGAATT CGAGACCAGT GTTGCCAACA TGGTGAAACC 
37101 CCGTCTCTAC TAAAAATACA AAAATTAGCC GGGCGTGGTG GCAGGCGCCT 
37151 GTAATCCCCA GCTACTCGGG AGGCTGAGGC AGGAGAATCA CTTGAACCCA 
37201 GGAGGTGGAG GTTGCAGTGA GCTGAGACGA CGCCATTGCA CTCCAGCCTG 
37251 AGAAACAAGA GAGAAACTCT GTCTCAAAAA AAATAAAGGA ATACAGACTC 
37301 TTAGAAAAAT AATTACAAAT AAAACCCTAG TGAAATTATA GGTATAGTTA 
37351 GGTATAGTTG GCTTACAGGT GGGAAGTAGA CCATTACCAA CTGATAGACT 
37401 GGGGAGCTGG AGAGAGGACA CGGAAGAGTG TCCTTGGATT TTTCNNNNNN 
37451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
37501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
37551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
37 601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
37 651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NAAAATTGTC TATATTCATT 
37701 GCCTCCTCCT CTTTACACCC TATTCACATT AGTATATCTG GCAAAAATTT 
37751 TTTTTAACTG AATGGTAAAT GCATGACTGA CCTTTCAATT AAAGCCAGGA 
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37 801 GAAAGAAACA AATCTTAATA GAAGAAATGA ATAGTTACCC TTTGCTTAGG 

37851 GAGCAAGGAA ACATGCAAGT TAAATTCAGA AAATCCATTT GGAAAATTCA 

37 901 AGTAACATGA AGAATTTTTA TTTGGTATGT TTGAATTTCT ATGAAATTAT 

37 951 GAAATAAGCC ATATCCTCTT TCTAGGTGCT ATCAGCTGCC TCAGCTGCAG 

38001 GGGTTTCTGT AGCTTTTGGT GCACCAATTG GAGGAGTTCT TTTTAGCCTG 

38051 GAAGAGGTAG GTGAAAAGAA TACAACAATT AAAATTATAT ATAATTACCA 

38101 TTACAAATAT ATTTCACACA TTTCAGTTTT GTAGGTGATG TAATAGGTAG 

38151 AGACTTTGTT TTCAAATTTA TTTTTCTAAA GTTGTTTTCC ACTCATTCTT 

38201 AATAAAAAGT AAATGTTATT CATGCTCCAT ACCTGGAGGA AACTTTTTAA 

38251 AAATTTATTA ATGTATGAAT GTTAGTAATT ATTTAAAATC TAACTTTGTT 

38301 GACATATTTA AAAGTAAGAA GATGTGAATT TGACTTAATA GAGGACATGT 

38351 GAAACAATCT ATTTCCATTG GCTAAATTCT GTATTTTTAG TAGAGATGGA 

38401 ATTTCACCAT GTTGGCCAGG CTGTTTTTTT GTGGGGTTTT TTTGTTTTGT 

38451 TTTGTTTTGT TTTTGTTTTT GAGACGGAGT TTCACTCTTG TTGCCCAGGC 

38501 TGGAGTGCAA TGGCGCGATC TTGGCTCACT GCAACCTCCG CCTCCAGGGT 

38551 TCAAGTGATT CTCCTGCCTC AGCCTCCCAA GTAGTTTTTG TTTAAAAAAT 

38601 TTTAATCAAT TCCTATGTTG AGTTTTAAAG TTTTTCCCAT GTGATTATTT 

38651 CTGATACAGT TAGTGATGTT AAAGAAAATA ATTTTAGTGA CTTCAGTGGA 

38701 TTATTTTGTT TTTGTTTTGT TAATAGGTGT TTAAGACTTT TCTTTTTACA 

38751 TAAAAATGTA ACCAGGAATT TTTTTTTAAT TTTTTTTGAC AAATAATAAT 

38801 TGTTTTTGTT TATGGGGTAT AATGTGATGT GTCTATACAT GTATACATTG 

38851 CGGAATAATC AAATCAGAGT GATTAGCAAA TCCCTCAAAT ATTTATTATG 

38901 TCCTTGTGGT GGTGAGAACA TTTAAAATCC TCTTTTAGCT ATTTTGAAAT 

38951 ATATAATACA TATTATTAAC TGTGGTCATC TTACTGTGCA ATAGAACACC 

39001 AGAACTTATT CCTCCTCTGT AAGTTCATAC CCGTTGACTA ATGTCTCCCC 

39051 TTTCCCTGTT CACCTCCCCA ACCCCTAGCC TCTGGTAACC CCTATTCTAC 

39101 TCTCTACTTC TATGAATTTA ACTCTTTTAG TTCAAGATGT TTTTAAATGT 

39151 ACTTTTTTCT TTTAGTTGTT TGTATTCTTT tTTTTTTTTT AATGTAGAAG 

39201 AGGCAAATTA AATGCATTAT AAGTTAACAG GAGTTGGTGA TGGTACATTT 

39251 ATTTTTAACT ACCATGATTG AATTGAATGT GAAACTCATT TTGAATATAA 

39301 AACAGCACTA GGTATTCTAT TAGTATTTAT TAGACATTTA TGATCAATTG 

39351 ATACTGTCAA TTTGTAATGA TGATCACCAT CTCCAAAAAT AATAATAACA 

39401 TCAATTTTTC TTATTACAGT AAAATCCATT ACATGTAAAT TCTAACTACA 

39451 GCAAAATTTA GAGCTAGGAT ATTTACCATT CAAGTTATAA TATATCAGAA 

39501 ACATCTTATA AAATTATAGC ATTAATTTTT CTTTTCCTTT TCTTTTTTTT 

39551 AGGTTAGCTA TTATTTTCCT CTCAAAACTT TATGGAGATC ATTTTTTGCT 

39601 GCTTTAGTGG CTGCATTTGT TTTGAGGTCC ATCAATCCAT TTGGTAACAG 

39651 CCGTCTGGTC CTTTTTTATG TGGAGTATCA TACACCATGG TACCTTTTTG 

39701 AACTGTTTCC TTTTATTCTT CTAGGGGTAT TTGGAGGGCT TTGGGGAGCC 

39751 TTTTTCATTA GGGCAAATAT TGCCTGGTGT CGTCGACGCA AGTCCACGAA 

39801 ATTTGGAAAG TATCCCGTTC TGGAAGTCAT TATTGTTGCA GCCATTACTG 

39851 CTGTGATAGC CTTCCCTAAT CCATACACTA GGCTAAACAC CAGTGAACTG 

39901 ATCAAAGAGC TTTTTACAGA CTGTGGTCCC CTGGAATCCT CTTCTCTTTG 

39951 TGACTACAGA AATGACATGA ATGCCAGTAA AATTGTCGAT GACATTCCTG 

40001 ATCGTCCAGC AGGCATTGGA GTATATTCAG CTATATGGCA GTTATGCCTG 

40051 GCACTCATAT TTAAAATCAT AATGACAGTA TTCACTTTTG GCATCAAGGT 

40101 AAGTGCTAAT GTGAGGTGAT ATTTGGGTAA TTTTGGCATG TTCAAAACTT 

40151 ATATGTGGAA TGAGAGAGGT TGTTGTTTCA TAAATGACTG AAAAAAGTAC 

40201 TTATCTTTTG AGTTTAATTT TAAGTAATGA AAAAGATAAT TCCTTAGCAT 

40251 ATATTGTTGA CCATGTTATC TGTTGCTATT TAACAAATTA CCCCCCAAAA 

40301 CTTAGCAGCT TAAGGTAACT ACTTATTTTG TTCTTGATAT TGAGTCAACG 

40351 ACTTGGGAAG GGCTCAACTG GGCAATTTTT GCTTGTGGTC TTTCATATAG 

40401 TTGTTATTAG ACATGGCGAG GGCTAATCAT CTCAAAGCTT CTTTTTTTCG 

40451 TTTCCTTTTT AAAAAACTGT TTTTGTGGAT ACACAGTAGC TATATATAGT 

40501 TTTGGGGTAT ATGAAGTATT TTGATAGAGG CATGGAGTGC ATAATAATCT 

40551 CAGGGTAAAT GGAGTATCCA TCACCTCAAG CATTTATCCC TTGTGTTACA 

40601 AACAATCCAA TTACACTCTT AATTATTTTT AAGTGTACAA TTAAATTATT 

40651 GAATATAGTT CAAAGACTTC TTCATTCATG ACTAGCACCT AGGCTAAAAA 

4 0701 AATTCAGACA CCTGGGCTCC TGGGATCAAT CACGCATACT GTGTCTCTTG 

40751 TGCTCACTCC CGCTGTCTCT CTCTCTTTCT CTCGCTTCCT TTTTCCTCTC 

40801 TCTCTGTGGT TTTCTAGGGT GGTGGCCTCA GGGAATTGGA TTTCTTATAT 

40851 TATAGCTCAG GATTCCCAAG AGGGCTGTTT TTAATGTAGC CAAAGAAGTC 

40901 TTGCAGCGTG ACTTGTTTTA TTCTATTCAT TGAGGTAGTC ACAGAGGCCC 
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AGAGGAGGGA CATACACTTG 
TGTTTTAAAA CCACTTTTAT 
CACAACTTAA TGGCTTAAAA 
CAAATCAAAA GTCTGAAACG 
TAGGGCATTC TGGAGGCTGT 
GCTATTAAAA GCTGCCAGCA 
TTCAAAGCCA GCAGTAGCTG 
ACCCAAACTC TGCTAAATCT 
ACTTTAGGCC CACGCAGATA 
AGTTGCTTCG AAACTTTCTT 
ACGTAATGTA ATCACAGGTT 
AGCCATTATT CTGCCTCATA 
TGCAGACTCA GTTAAATTAC 
AAGAAATTAA GGCAGCTGGG 
ATTTATAAAA TAACAAAATT 
TTTCTTATGT GACTATCAGT 
TTCTTGTTCT TTAAATTACT 
AGTGATGATG TGCTTTTACA 
TTTCTACAAA ATTAGAGTGG 
CTTGTCCTAA GTCTGTAGTC 
AGCCTGGCCA GGCGCAGTGG 
GCCGAGGCGG GCAGATCACA 
TGCGGTGAAA CCCCGTCTTT 
TGGCAGGTGT CTGTAATCCC 
ACTTAAACCC AGGAGGTGGA 
CACTCCAGCC CAGGCAACAG 
AGAAGAAAAT AATAATAGTA 
TCTTAGGGAA TCAAGTTAAA 
AGTAAGTGTT GGGTAAATCC 
AAGCTGGAGG CAATTGGCAG 
TCTGGCTGAA GGATCATACC 
TTTTCTATTG TTTATCCTTT 
CCATCATTTT GTGTTTGGCT 
TTCAGTAGTT TGTTTGTTCG 
TTTTTGCCCA GACTAGAGGG 
CCACCTCCCA GGTTCAAGCG 
GGATTACAGG TGTGGGCCAT 
GAGATGTGAT TTTGCCATGT 
AGGTGATCCT TTGGGAGGCC 
GTGAGCCACT GAACCTGGCC 
ATGGTGCTTC TTATCCCTGT 
TATTTTTCTC TTTTTAATTG 
TAGGTTCAGT TCAGAGCACC 
TCACATAAAA GTCTTAGTTT 
ATGCTGTAGT CTTATGTGTA 
ACTTTAATTT AAAAATACTT 
GTCTTCAGTG AATTGTAATC 
ATTGGTGGTT GCTAGAGGTA 
TAACAGTGAA ATTTGCCGCA 
CTCTGTAGCA TGTGATGCTG 
CTTTTCAAAA TTAGAGTCAT 
TAAGTTTAAG GAAAATTCAA 
CATCTTTACC AGGACTGGAT 
TCCATAAGAA GTAACTCCTT 
GCAATTCAGT CACATCTTTA 
ATTTCCACCA CTCTGTAGTT 
GAGTCATTCA TGAGAGTTGG 
TGATATTTTG ACCTCCTCCC 
GAATGGTGAC TACTTTTTGA 
CAGAGAAGTT GTTATCAGTG 
TACAGATCTT CAGCAACCTC 
ACGGAGGGAC ATAAGGCAGA 
AGTGAAAGTT TATTATTAAA 



CTGGGACAAG TGTAAGAGAA 
TAGTTTCCTA TTGCTGCTGT 
GCCACACAAA TTTAATATCT 
GATCTCACTG TGCTAAAATT 
AGGAGAGAGT CTTGTTTTTT 
TTCCTTGGCT CCTGGCTGTC 
GTCAAGTCTT TCTCTTGTCT 
CCCTTCCACA TTTGAAAAAC 
AATCAGAAAA TAATCTCCTT 
TCTGCCACCT TGATTCCTCC 
CTGGGAATTA AGTTATGGAC 
CCAGTATAGG GTATTAGCTT 
TAGATCTATA AATACATGCC 
TCTTATGCCC TGGGACATTG 
TGTTGATTAA TGGTCTATCA 
GATATATATG GGGAAGCACA 
ACCCCTGTAC TTCATGTAAT 
GATGTAAATT AATGTGGAAT 
TTTTAGTTTT TGAAATAAGG 
CACTGAGTAT CTAGAGTTAA 
CTCACACCTG TAATCCCAGC 
ATGTCAGGAG ATCGAGACCA 
ACTAAAAATA CAAAAATTAG 
GGCTACTCGA GAGGCTGAGG 
GGTTGCAATG AGCCAAGATC 
GGCAAGACAC TGTCTCAAAA 
ATAGAAAAGC CTAAACATTT 
AGAGCTGTTA AAGCTCTTTT 
CAACTTTCTC ACAGTCAGTT 
GCCTTTGTTA AGTCCCACCT 
TGGCAAGAGA GTGTAAAACA 
TAATGATCCT AAGAGACTCA 
CATTTCATAT TCAGAGGAGT 
TTTGTTTGTT TTTTGAGACA 
CAGTGTTGCA GTCTTGGCTC 
ATTCTCCTGC CTCAGCCTCC 
CACACCCGGC TAATTTTTGT 
TGGCCAGGCT GGTCTGGAAC 
TTGGCCTCCC AGAGTGCTAG 
TCTTTCAGTA GTCTTTAAAT 
TTATTATCCT TATTAAATTT 
ATTCATATAA ATAGACTTAC 
ACAATAAAGT GAATATCATA 
CTTAGTGCAT ATAAAAGTTC 
CAATAGCATT ATGTCTTTTA 
GATTGCTAAA AAATGCTAAT 
TGTTTTGCTT CTGTAGGGTC 
GGACTGGCTG TAGCAATTCT 
TTGATTGACA CTGCCTTTCA 
TTTGATACCA TTTTACCTAC 
CCTCTCAAAC CCTGCTACTG 
AATCTTTTGT CCTTTTAACA 
TCTACCTCAA GAAACCACTT 
ATACATTCAA GTTTTTTAAA 
GGCTACGCTT ATCATTCTAG 
ACTTCTTCAA CTGAAGTCTT 
AATCAACTTC TTCCAAACTC 
ATGAAACGTG AATGTTCTGG 
ACATTTTCAA TTTATTTTGC 
GTGGGTTTCC AAGTTGTCAG 
AACTCTTGCC TTCTCAGAGG 
AAAAGAGACT GAGGCAAGTT 
AAGCTTTAGA GTGGGAATGA 
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4 4101 AAAGAAATTA AAATACACTT GAAAGAGGGC CAAGTGGGCA TCTTGGAAGA 
4 4151 CAAGTGCCCC ATTTGACCTT GGACTTAGGG TTTTATATGT TGGCATACTT 
4 4201 CTGGCATCTT GCATCCCTAT TCCATTGATT CTTCTTTTGG GGTGAGTTGC 
4 4251 CCACATGCTC AGTGGCCTGC TAGCACTTGG GAGGGGAGTG TGCACAGTGT 
4 4301 ATTTACTGGA GTTGTATGCA TGCTTACCTG AGGTGTTTGT TGCTTACCAG 
4 4351 CCAAATGTCC CTAGGAGGTC ATATTCATAA ACTCCATGAT TTTGCCTCTA 
4 4401 AATGTGCATG CTTGAGCCCA CTCACCCAAC TCCTGGGATC TTATCGGAAA 
4 4451 GCTGCCGATC GCTAGTTTCA GGTGTTTCTA TCTATTGGAA GATGGCCTTT 
4 4 501 CCCTGATGCT GGCTGCAACC AATTATTACT TTAGAGAGAG AGCATGAGAG 
44551 CTGTCTCACC ATCATCACCT GATGGTTGCC TGACATTCCT GGTGGGGTTG 
44601 GGAGGATGCC TGTCCTGCCC TGCTCATGCC TGACTAGCTA CCTGCTGTAA 
44 651 CAAAAGTACT ATCTATGGTA GCTGTAGCCA TAGGAAATGC ATTTCTTCAG 
44701 TAAAACTTAA AAGTCAAAAT TAGTCTTTAA AACAACATGA ATCTCCTTGT 
44751 ACATCTCCAT CAGAGCTCTT GGAAGACCAG GTGCATTATT AGTGATGAGT 
44801 AATGTTTTAA AAGGAATCTT TTTGTCTGAG CAGTAGGTCT CAACAGTGGG 
44851 CTTAAAATAG TTAGTAAACC ATGCTGTAAA GAGATATGCT GTTATCCAGG 
44 901 CTTTGTTATT CCATTTATAG AGCACAGAGA GAGTAGATTG GCATAATTTA 
4 4 951 AGGATTACTT AAAAAAAAAG TCTTTGATTA CTCTCAAAAA AAAGTCACGT 
45001 CTCTCACTTT ATATCAACAG CTAAAAATGG CCAGGTATTG TGGCTCACGC 
45051 CTGTAATCTC CATGCTTTGG GAGGCCAAGG CAGAAGGATC ACTTGAGGTC 
45101 AGGAGTTAGA GACTAACCTG GGCAACATAG TAAGACCCAT CTCTACAAAA 
45151 AAAAAAAAAA AAAAAAGAAA GCCAGGTGTG GTGGTGCACG CCTGTAGTCC 
45201 CAGCTACTCA CGAGGCTGAG TCGGCAGGAT CACGCCCAGC CAAGAGACGT 
45251 GACTTCTGCT TTCAGTTGTA CACTTAGAGA CCATTGTAGG GTTCTTAGTT 
45301 GGACTAATTT CAATATCATT GGGTCTCAGG GAATAGGGAA GCCTGAGAAG 
45351 AGGGAGAGAC AGGGGAACAG CCAGTTAGTG GAGCAGTCAG ACCACATACA 
45401 ACACTTATTA AGTTCACTTT CTTCTATGGG CATGGTTCAT GGTGCAGTAA 
4 5451 AACAACTGTA ACAGGAACAT CAAAGATCAT TAATCACAGA GCACTGTAAC 
45501 ATATAATAAT AGTGAAAATT TTCAAAGTAT TGAGAGAATT AGCAAAATAT 
45551 GATACAGAGA CACAAAGTGA CCACATGCTG TTGGAAAAGT AGTGCTGATG 
45601 GACTAGCTTG ATGCAAGGAT GTCATAAACC TCAATTTGTG AAAACTGCAA 
45651 CATGTGTGAA GCACAGTAAC ACAAAGCATA GTAAAACAAG ATATGTCTGT 
45701 ATATCAGTCA AAATATTGGG CAACTCTGAT AAGTTTGTCC ACTTAACATT 
45751 GTACCACTTA AGATGAATAG CATCTACCAT TTCCGTCATT TGTAAATATA 
45801 TAGGAGGACA TAATCACATA ATCTTGAAGT AAAAGACAGT GCTTAAAACT 
45851 GAATCAGTTA AGTTTTATGA AAAATACTTC ATATTGTACT TTTAAAAATA 
45901 TATATTTTTT AATTTCAATA GCTTTTGGGT TACAAGTGGT TTTGGTTACG 
45951 TGGATGAATT CTATAATGGT GAAGTCTAAG ATTTTACTGC AACTGTCACC 
4 6001 CAAGTAGTAT ATATTGTATC CAGCATATTG TCCTTTTTTT TTTCTTTTTT 
4 6051 TTTTTTCATT TCACCATGGA CTAATGAAAA TTTTGTTAGG GACTGACATT 
4 6101 AGGGCACCCT TGAGCTACCT TGAGCTAAAG GAAATAACCC TTGAATTTTT 
4 6151 TTCTGTTTGG CCTAGAGAAT GTGGTTTGTT TTGTAACTGA ATTCATGGGA 
4 6201 TTGTTAAGGT ACAAGATTTT GCTTTAGTTT TATTTGTACT AGGATTTTGC 
4 6251 TATATTAATA CAATGTGAAA AGAATCAAAA GTGTTAGAAA TAAATGCATA 
4 6301 GAATGTAAGT TTCAGGCATG TGAGTAGAGG ATCTCTGCTC CATAAAGAGT 
4 6351 TCTGTTGTTG TTATAGGTTC CATCAGGCTT GTTCATCCCC AGCATGGCCA 
4 6401 TTGGAGCGAT CGCAGGAAGG ATTGTGGGGA TTGCGGTGGA GCAGCTTGCC 
4 6451 TACTATCACC ACGACTGGTT TATCTTTAAG GAGTGGTGTG AGGTCGGGGC 
4 6501 TGATTGCATT ACACCTGGCC TTTATGCCAT GGTTGGTGCT GCTGCATGCT 
4 6551 TAGGTAATAT GGCTGTGTCT GCCTGTGTGT GGATGTTTGC AAGTCTGAGA 
4 6601 GAGCCAAGAG AAAGTGGGAC ACATTCTTGC TTAATTGGTG GGCGGATTGG 
4 6651 TTGAGTAAAG GAGGGTGCCA GGAGGAGATG TTTTAACAGA TAAGAAACAG 
4 6701 TAGTACTATT AGGGTATTAT ACAGTACCGG TTTTCTGTCT TACAACATTT 
4 6751 GTTAATACAA GAATTTAATG GCATTAGCAT ATTGTAATAT AACTTAATAC 
4 6801 ACTATGGCAG AAGCCATCTA AGTACAACAT AAGCTTAATT TGAATCCTGA 
4 6851 CCAAAGATGT CTTTGATTCT TTCATCGTTA AGGATCTTGG CTTACCTATA 
4 6901 ACAACTATAG CATAATACCT AAGATTAGCA TTGCAACAGA GTTTCAGAGT 
4 6951 AGGTTTACTT TGGTTCTGAA ATGATTTATT GTTAGCCTTA GTAAAAGATG 
47001 TATTTACCCA TGCTCCATCA TCTAAGGTAT ATTTGTAACA AAATGAGAAA 
47051 AGGTAACTTC ATTTTAATGA GAAGAAAAGC AAAATACCTA CATTAAGTAC 
47101 TTGAGTCTAT TTAATGTCTG TTAGGGCAGG AAAAAATGGT TATTGCTTTT 
47151 CATATTTAAA ATATCAGCTA CACTCTGGTG ATAATATTAA TGGTTGCCAT 
47201 TTTGACCAGT TTTGTTTAGT GAATAAAAAT TATGTGATTA TTGATCTTTA 
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47251 AAAATGTAAT ATCAATTAAA AGGAAAGGAC AGACTCATTT TCACCAAAGT 

47301 AGCAAGTATT TATTAAATGT CCACTTTCTT TTTAGCATTG TGCTAGATAC 

47351 AGTGCATAAT ACAAAAAGAA CATGGACCCA ATCTCGACTC TAATCAAGTT 

47401 GAGGAGACAA GATGAACACT GAGAATACAA TAGTGAGGAA TACTAACAAA 

47451 TATATACAAG GTTAAAAGAG TCTAAGTATG GTAGGAATAT AGGGGAAGAA 

47501 AGAGCTGAAG TACTTCAGGA AGAGTAGAAC ATGAGGCTTT ATTTAAAAGA 

47551 TTAGCAGAAT TTAAGGAAAA GGTGACTTTG TTGAAGATTA TAATGTGAAG 

47 601 ACAAAGGAAC GAGGATGGGA ATAAATTTTG TATTCATGAG GCTTTGAAGA 

47 651 AATTGACTCT AGAGAGTATA TTTTGGGTAC TTTTGGGAAA TGAAGTTGGA 

47701 TTAGTGAGAA GGAACAGATT ATGAAAAGAC AAGAAACCTG ATTAATGTCA 

47751 GGATGATTTT ATATTTGAAG TTGGTCAGAT TTATGGCAGT CCTGGCTTTG 

47801 CCATTTTTAG TTTGATGACT TTGAGAAAGT TCCTTCTTGA AGTTTTAATT 

47851 TTCTGTATAT AAAAAGTAAT AACACCTGGT GATCTGCTAG GTTTGTTTTG 

47901 AGGATTATAT GAGATAAAAT GCATGCAAAA CTGTTATAAT AGTGCCTGGT 

47951 AAAATAAGTG CCTAGTTTTA AAAACAAGTC TTTGTAAACT GCTTAGGACA 

48001 TGCCTGGTAT AGGGTAGGTA TGTAATACAT AGTAGGTAGG ATCTGTCTCC 

48051 TTGCTATTTT TAGGTAAAAA AACAAAAGGA AGAGCTTCAG CTTAATACAG 

48101 TATGAACTGA CGAGCCCTGG TAGGTTTTTG AGCAAAAGAG CAACACAGTA 

48151 AAAGTAGTAC TTAGGAAAGA TTAACAAGGG AACATGGCTT ATACAGTGGT 

48201 AATGGGGCCT GGAGTCAAGG AGGTAAGATA AAATGGTATT ATAATTAAGG 

48251 AATAGCCAGG CACGATGGCA CATGCATGTA ATGCCAGCTA CTGGAGAGGC 

48301 TGAGGTGGGA GGATCATGGG AGTCCAGGAG TTTGAGACCA GCCTGGGCAA 

48351 CTGAGTGAGA CCCCAAATCC TAAAAAATAC AAAGTAAAAA AGGAATAAAG 

48401 TCATGAGGGC TTGGACTGGA TTGATAACAG TGAGAATACC GAGAAAGGGA 

48451 CCATAGGCAG TGTGAACGCA GCTCACTGCA GCCTCAAACC CCAGCCCAAA 

4 8501 CGAGCCTCCC ACCTCAGCCT CCCAAGTAGC TGGGACCACA GACATACACC 

48551 ACCATGCATG ACTACTTTTT TTAGTTTTTA CTTTTGTAGA GACAGGGTCT 

4 8601 CACTGTATTG CCCAGGCTGG TCTCAAACTC CTTGACTTAA GTGATCTTCC 

4 8651 TGCCTTGGCC TCCCAAAGTG ATTACAGGCA TGAGCCACAG TGCCTGGCCC 

48701 AAATAGTTTT CTGTGAGTGA ATATTACTTG CATCGTTAAT GTAAATCAAA 

48751 GGCATCAAAG TATTTTACTC TTTTTGAAAA AAATTTAGAG GAGAAATTTA 

48801 TTATATTMT ATTCTACCCA TATATGAGTT TAATTTGTAA ATTGTAGCAA 

48851 AGCATGATGT GCTTTACTAA ATTCCTTTAT AATTAGAATA AGCTTTTATA 

4 8901 AGGGTGAAAT TATGTCTTTG CTACAGCACT AAACCAAAAT GGCAAAATTG 

48951 TTTTAGTCGG TAAGCTTTGC TTTTTTAAAA TATGAAATAA ACAGGTTTTT 

4 9001 AAAATGTTAT TTTAATAGTC TTCTCTGTTA TAAACAAAGA AAATTGGTGT 

49051 TTCTCTAGAG CTTATTAAAA GTAGTGATTA TTGTCCTAAA AGAGGAGTAG 

49101 CAGTTTTAGA TGCTAATGCT TTTCCCTGAC TGAGTTCTAT TTGCCATTTA 

4 9151 GTTTTAACTG CCTAGTGCAA AAATTCTAAT AAAATGTAAT GATGAGGATC 

49201 CTGTCCTTCC TGACCAGTGG GTGCTTACTT TTTTCAGGTG GTGTGACAAG 

49251 AATGACTGTC TCCCTGGTGG TTATTGTTTT TGAGCTTACT GGAGGCTTGG 

49301 AATATATTGT TCCCCTTATG GCTGCAGTCA TGACCAGTAA ATGGGTTGGA 

4 9351 GATGCCTTTG GCAGGGAAGG CATTTATGAA GCACACATCC GATTAAATGG 

49401 ATACCCTTTC TTGGATGCAA AAGAAGAATT CACTCATACC ACCCTGGCTG 

49451 CTGACGTTAT GAGACCTCTA AGGAATGATC CTCCCTTAGC TGTCCTGACA 

49501 CAGGACAATA TGACAGTGGA TGATATAGAA AACATGATTA ATGAAACCAG 

4 9551 CTACAATGGA TTTCCTGTCA TAATGTCAAA AGAATCTCAG AGATTAGTGG 

4 9601 GATTTGCCCT CAGAAGAGAC CTGACAATTG CAATAGGTAC CCTTTCAAAA 

4 9651 ATATATATAT GTATATATGA GATGGATTTC' TGGAAGAAAG GAAAGCAATA 

4 9701 AGCAGTAACA TTTAATGGGT CGGATTTGTG GGGGCAAGGG ACATTATTTC 

4 9751 ATGTCCCTTA ACATCTTCTG TTCTTTAAGA AAGGAAGGTA TGCTTCAGTG 

4 9801 GATGATTTTC TGCTATATAT CACAAAATCT GTATTTCAGG TTTGTCTTTT 

4 9851 GATCCGGCAT GTACCAGAAA TTGGAGTCAG ATTATTTTCC CACTCAGATA 

4 9901 AGCCTAGATA AGTTGATCTT GGTTATTCAA AACAGCATGT AATATAAGAC 

4 9951 CTTAGCTAAA TGCATTCAGT CAAATACATT CTTGTATTTA ATAAAGTTGG 

50001 CTTATTGGAA TACAAGTTAT TGAAAATCTC ATCTTCATCA GTCTCTTTCA 

50051 TATTAGAATA ACACTGTTTT GCTTTATCAG TCTTTGGGGT TAGAATTATA 

50101 ATATTAATTT ATAATATCTG ATTTAAAGTG ACAATCACTG AGATTTTTAT 

50151 TTCTGATCAA ATGCCAGGTT GAAAAAGTAT AACGTATCAG TCCTGTTGTG 

50201 TTTTATGCAG ACTTTCCTGA AAATACTGTT TAAAGGTATT AGCCATAGTG 

50251 TATTTCTTGG AGATAAATTA AACTTTCTAT AGTTCTGTTT CTCTAAAATT 

50301 TGTTTTTCTC TTTACCTTAT AGTCCCGCAG TATTGATGAG GAGACCATTA 

50351 AGACTTAATA TTTTTTTGAC ACAATCTTAT ATCTCTTCCT CCAACCCCTA 
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50401 AAAAGTGACT GAGGATAGGT ACATCAAGCC ATTGCTTTGT TACTCCCCAG 
504 51 GTTTTAGTGC CAGACCCTGA ATGGAAGTGT CAAGCCTTTG GCCTGTCTGA 
50501 AAGGTCATTC CTGTGAGCAT ATCATCTCCC TTCCAGCTTA CCTCTGTGGC 
50551 CATTGCAAAA GGATTTAAAA ATAATTTTTG TGCCATTTGA ATGGCACAAG 
50601 ACCAGACAGT GTATGTGGGG GAGTGTTTCT CAAATCAAAC TGGAAACTCT 
50651 TTAATTTGTA AGAACCATTA AGCAGAGAGA GAAAAAAGAA AGGAAAAGAA 
50701 AAAAGATCCT ACAGAGAACA CCCTGTTCAG TTTGGGAACA GGCTACAGCT 
50751 TTGGATTTTT CAAGGCCTAG CATTCCCATC ATTCTAAATT TTACTTAGCT 
50801 AATACAATAG TAGTTGCCAG AGCTGATGAC ATAGTATTTT GTCATGCTTG 
50851 GCTCCGTTCA AGCATTTTAG TTTTTTAGCC ATTACCATGG CTAGACCCAG 
50901 TCAAAAGAAT TTTCATTGTT TAAGATTCCC ATTATCCTAG TTTTTACTAG 
50951 TAGCCAGCCA AAGAAAAGAA AAAGGAGGTC AGAATTTCGG TATTTACATA 
51001 GAAATTTAAG GGGAAAAGGC CAGGCATGTT TTTAAAGTGT GGAAATTAAG 
51051 AACTATTCAT TATCCCACTG ATTGTGTGGA TGTGTTTTTT AAAGTTTTGT 
51101 TACTGTCTTG AGAGAGAGAA TATTGAGATA GGACATAATG TTGGTTTAAG 
51151 GGAATGAGGG TACTTTCTGT AGGTGAGGTG CCAAGCCATG TCATCAGAAA 
51201 TGTTAGTCAC ATGACTTTCT AAGCACACCT TAAATGTTTT ACCGTGTATG 
51251 TTTTTGTAAA GTTTTAAATT TTTAACTGGG AAAAACAGAC CTGTATATTA 
51301 AGTTTTATAT AT AT AT AT AA ATTTAAAATT ACATATATAT GTTTATATAT 
51351 GTAACTTTTA TATGGGAGAG ATATATATTT CTATATCCTC TATAAAAAAA 
51401 CATATCTATA TATGAAAATT ATGTACGTAA ATGTTAATTT ATAATTAATT 
51451 ATATAAATAT TAACATAATT ACATTATATA TATAGAAAAC CTAGTGTACA 
51501 GATCTGTATA TAAATTAAAA ATGTATGTGT TATATATAGT TACATCATAT 
51551 AATACATATA ATTGATATAT ATAATGATAA ATACTTTATT GAAGGATGAA 
51601 AAAATTTCCA TGCTGTCTCA TAAAATAAGA TGGTTGACAT ATGCTAAACT 
51651 AGATAGATTC TCCTGTTTCA TACTAAAGCA GAATGTTGTA AAATATTAAA 
51701 TCCAAATGAG ATGTCTCAGA TTAAGGCCAT TTCAACAGGA ATGCTGAGAC 
517 51 TTTAAAAAAA AAAAAAGTCT GAGGCTGGGC GTGGTGGCTC ATGCCTGTAA 
51801 TCCCAGCACT TTGGGAAGCT GAAGCAGGTG GATCACTTGA GGCCAGGAGT 
51851 TTGAGACCAG CCTGGCCAAT GTGGTGAAAT CCCGCCTCTA CTAAAATACA 
51901 AAAAAAATAC ATGGGTGTGG TGACGCATGC CTATAATTCC AGCTACTTGG 
51951 GAGGCTGAGG CAGGAGAATC ACTTGAACCT GGGAGGTGGA GATTGCAGTA 
52001 AGCCCCACCA CTGCACTCCA GCCTGGGCGA AGAGCAAAAC CCTGTCTCAA 
52051 AAAAAAAAAA AGCCTGAATT ATATCAGCAA ATGAAAACTG TAATGTTGTT 
52101 CTCTGTTTCA GAGGCCCTTG AATGAATAGC ACTAAAAATA TTTTTTAAAA 
52151 AATGAAGAAA ATGAAAATTG TAATGTTCCT TATTTAAAAG GCCCTTGAAT 
52201 GAGTAGCATC AAAAATATTT TTAAATGGGA GGCCAGGGTG GGAGGTTTGT 
52251 TTGGCACCAG GAGATCAAGA CCAGCTTGGG TAACATAGCA AGACCTTTGT 
52301 CTCTACCAAA AAAAAAAAAT TGGGTGTGGT GGTGCCACCT GTATTCCTAG 
52351 CTACTGGGAA CACTGATGCA GGAGGATCCC TGGGACTCTA GAGTCCAGAG 
52401 TGAGACCCTG TCTCTAAAAC AAACAAACAA ACAAAAACTG TATTTATGTA 
52451 AAAGTAATAC TTGTTTTTTA AATTTTATTT ATTTTTAATT GATAAAAATT 
52501 GTATGTATGT TTATGTGATG TATATATTGT GGAATGGTTA AATCAGGCTA 
52551 ATTAACTCAG ATTTTTTGTG TGTGTGGGGA GAATATCTAA AATCCCTCTC 
52601 CTTAGCAGTT TCCAAATGAA ATGAAAGAAT AAAAGTGATT TATTTTTTTG 
52651 AGACAGCATC TCACCCTGTT TCTCAGGCTG GAATGCAGTG GCACGATCTT 
52701 GGCTTACTTG ATCCTCGACT TCCCTGGCAT CCGGTGATCC TCCCACTTCA 
52751 CTCTCCTAAT TAGCTAGGAC TACAGGCATG CGCCACCATG ACTGGCTAAT 
52801 TTTTGTATTT CTTGTATAGG CAAGGTTTTG CCATGTTGCC CAGGCTGGTT 
52851 TCAAGCTCCT GGGCTCAAAC GATCCACCTG CCTCAGCCTC CTGAAGTGCT 
52 901 GGGATTACAA GTGTGAGCCA CCACACCTGG CGAAAAGTGT TATTTTTTTA 
52 951 AATGACAAAT TTAAGTCAAA GAGATTGAAT GTTCACTTCT GGTACTTTGT 
53001 ATATAAGAGA AACATTCCAT TAAATAATTT TTTAAACATT TCTAAAATTA 
53051 CATATTTTGT CATTAAATGT TTAAACAATC AGTATAATTT CATTGATACA 
53101 GTGTTTGTTA TTTTGTCGGT GTTTAAGATT GATAATTGGG GTTAGTTTTA 
53151 ATTCAGAATG TTATTCTATT TAATGTCACA CTTCATGTCT TTTTATTTTG 
53201 TATATCTATT AATGAATTAT TTTAGCTATA GTTATTACTG TTTTAGAGAT 
53251 GAGGTCTTCT ATGTTGCCCA GGGTAGACTT GAACTCCTGG GCTTCAGCAA 
53301 TCCCCTCCTC AACCTCCGGA GCACATGAGA TTAGAGACGT GTGCCACTGT 
53351 ATCTGGCCTG CTGTAGTTAT TTTTAATTCT TTTGTCTTTC AACTTTTATA 
53401 CTAGAGTTAG AAATGATTTA CAAACCCTAT TGCAGTTTTA GAGCGTTATG 
53451 AATTTGACTA TATATTTCTT ATAACAACTT AACTTCAGTT GCTTACAAAA 
53501 ACTACAGAGT TTTACTCCCC CGTCCACATT TTATACTATT GATGTCACAC 
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53551 TTTACATCTT TTTATTTTGT GAATCCATTA ATGATACTTC TGGTAGTTTT 
53601 TACACTCCAC TATTCAGTTG TCAGACACCA TTCAGTTGTT AGATTGTTAT 
53651 GAGCTAAAAG CAACTTAATG GGTATTTTTC AAAAATCATT TATGTCAATT 
53701 GCTAATGGAC TTCTTTTCTA TGCCATGATC ATGCTTTTTT TATTTTTGAG 
53751 ACGGAGTTTC ACTCTTGTTG CCTGGGCTGG AGTGCAATGG CGCGGCCTCA 
53801 GCTCACTGCA ACCTCCGCCT CCTGGGTTCA AGCGATTCTC CTGCCTCAGC 
53851 TGGGATTACA GGCATGTGCC ACCGTGCCGG CTAATTTTGT ATTTTTAGTA 
53901 GAGACAGGGT TTCACCATGT TGGCCAGGCT GGTCTCGAAC TCCTGACCTC 
53951 AGTTGATCTG CCCACCTTGG CCTCCCAAAG TGCTGGGATT ACAGACGTGA 
54001 GCCACTGCGC CTGGCCTGAT CATGCTTTTA AGGTGGTTGA GTAAGTACTA 
54051 GTTGCTGGGG CTTTACTTAG TGCCCTCCTA CTCAAATGTG TTAGAACATA 
54101 GTTAAGAAGG CTGTAGTGTT CAAAAGGAGT AAAAAGCAGT GCAGTGTTTG 
54151 CAGTAATATC TGCTTCTCAA TTTAGGACTG ATGCTTATTA TGGCTTAAAT 
54201 GTTTTTGTAG TAAAATTTGT ATTCAAAAT^^ TATATTTTTT TTTCTTTTTG 
54251 CGACAGAGTC TTGCTTTGTC ACCCAGGCTG GAGTGTGGTG GTATGATCAT 
54301 GGCTGACTGC AGCCCTGACC TTCCGGGCTC AAGTGATCTT TCCACCTCAG 
54351 CCTCCCAATT ACTTGGGACC ACCAGCATGC TTGGCCGATT ttTTTTTTTT 
54 401 TTTTTTTTTT GTAGAAGCAA GGTTTCCCTA TGTTGCCAAG GCTGGTCTTG 
54 451 AACTTTAGGG CTCATGTGAT ACTCCTGCCT CGGCCTCCCA AAGTGTTAGG 
54501 ATTACAAGCC TGAGCCACCA TGGCCGGCCA AAATATTTTC ACTATAACAA 
54551 ATATCATATC TGTATATACT CAGTTTTAAT ACTAACTCAA AGTAGAAACA 
54 601 TAAAGCTGAA TGACTATTTT ATTTTCAGAT TCTCTCCATT GAGTTTCCTT 
54 651 CTCCGTCTTG TGTGATCTCT GAACTTTTCT CCATCTTTGC CACTTCTTGT 
54701 CTAGCATTTT TTTTTTATCA GCAGTTTCAT TCAGATTTTT TTTTTAGTTC 
54751 TTTCAACGGT GGAGTGGAAG TAGGCAGCAG GACAGAAGAA CTTGAAGCAG 
54801 AGCACACTGG AGAGGAGAAA TTAACAAAGC CTTTATGAAT AAAACAACCC 
54851 CCCAATATCA GTCTGTGTGC ATTATGAGCA TAATTGTACT TTCATCTCAT 
54 901 CTGTAATGTT CATGACTTTT CTAGAAAATT ATACTTTAAC ATGAGAAAAG 
54 951 AAAAAGAACC AGCTAATTCA TAGGGATGGA GGACACAGCA TAGTCAAAGC 
55001 AAGAATGAAA CTCTCTTTAG TGCCACCTCC AGTGCAGAAT AAGTAACATT 
55051 CAGCAGAGGC AGGTTTCATT TGATAATGGA TTCCTATAAT AAACTGCGCT 
55101 CAGAATTTGT GCAGGTTTTA AAATCCCGTA TTCCAAACCC ACTTCCTTAG 
55151 CCCCCAAGTT AGAAAACAGC TTCAGTAAAG AAAATTGTAC GATGATATAA 
55201 CTTTACCAAA AAATAATTTC TTTCCATGAA GATGATATAT TATTGTTGAC 
55251 TTCTAATTCA ATCAAATATA AACAATTGCT AAATGGCTTT TCAGTTGACT 
55301 CCTTTCTTGG TTAAGGAGAA GATAGGAAAA AATGAAGGGA TCAGAAGTCA 
55351 TAGGATACAT TAATTTTTTT TATCTCTGAA TAAACAGGTT GCCTACTTAA 
55401 AAATCTATCA GTTTAAAAGT GTTGGTCTCT TCTCTCTCTT TTCAGAAAGT 
55451 GCCAGGAAAA AACAAGAAGG TATCGTTGGC AGTTCTCGGG TGTGTTTTGC 
55501 ACAGCACACC CCATCTCTTC CAGCAGAAAG TCCTCGGCCA TTGAAGCTTC 
55551 GAAGCATTCT TGACATGAGC CCTTTTACAG TGACAGACCA CACCCCAATG 
55601 GAGATCGTGG TGGATATTTT CCGAAAGCTG GGACTGAGGC AGTGCCTTGT 
55651 AACTCACAAT GGGTAAGTCT GGTACCACAG GAATCAGTTC ACTTGCTAGA 
55701 ATATAGGATC CTTTTTAGTG GAATCTATAT AGTTATTAGG GGAGCATGTG 
55751 AGTCAGCTCC CAGGTGGGAA AGTCTGTCCT ATGGTATAGT CACAAATATA 
55801 GGATCAGTCA ATCAAATTTC ACATTTACTA AGGAATAAGA AAGATGTCAT 
55851 CTGCCTGCTC TTTGCCAAAC AGTGACATTT GTAAATAATA CCTCAAAGTT 
55901 GGAAAAGAGG TGCTGAAAGA TCTCCAGCAT GAAAGCATGT TGAGCTTAGA 
55951 GTGCTTCTTT TCCTAGGGAA GAGTGGACCT AACCTGCATG GAGCACTGCA 
56001 AAAACCTGTT TTATTTTTGT AAATGTTTCA TTTTTAGTAT ATAAATTTCT 
56051 AGTACAATAA TAAGTTTCTA GATATTTTGC TATTTACTCT TTCAGCCAAT 
56101 ATTTGATTTA TCATGTAATG AAGGAAAGAA TATATACTTA AATGAAATTT 
56151 GTAAATGAGC TAAAAATCTC CTTTAACAAA TGCTTTGTTT CCTTTTGTCT 
56201 ACCTTTCTCT ATACACAAAT CTTTTATATT TATATAACTG CTAAGGACAA 
56251 ATAAATACTC ATGTATTTAA AATGTATACA TTGATAATTT ATTTTTCCAC 
56301 CTTTTACACA TGAACTGCCA GTGTTTCTCC ATTGACAGGA ATATAGGAAA 
56351 GAAACAGATG TCACGGGGGT TGTGGAGACC TTAATGCACA GAATTGATTT 
56401 AGCAAATACA CTACTTCGTC ACCACTGCTC TCTTTTCCTG GACCTGGGAT 
56451 GTGTTTCTCC ACACTTCTTT CTTTAGGACC CTTCATTTCC ACTATATATT 
56501 CTTTCTTGTT GTU^CTTAAGA ATGTTGTTTT ATCCGAAGGC AAATACCAAA 
56551 AAACAGAGGG TATTCTTGGA TTATGCATAA ACTGGATGGC TAATCCTGAA 
56601 CAGCGTAAAG CTGGTTGAAA TTCTAAACAG AGAATCATAG CAGTTTTTTG 
56651 TTGTTTTTTT TTTTTAACAT GTTGTAGAAA ACACATTGGT GACAGAATAC 
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56701 ATGACTCCTG TCCAGAGAAA GGAGAGAAAA AGAACAGAAA GGAAGGAAAT 

56751 TTGTTTATTG AACACCTTCA TATTTTCTCA TTTAACTTTG CAGGACCTCT 

56801 GCAAAGTAGG TAGTTATATC CCTACTTTAC AGATGTAGTA ATTAAAGCTC 

56851 AGGAAGCTTT AATAATTTGC CCAAAGTCAT GTGGTGAACA AGTCATGGTT 

56901 CAAGGAATCA GACTGTCTTT CCTACTTTAA AACCCAGCCT CTTGCTACTA 

56951 TTTTGCACTG TAAGTGACTG ATAGAAATCC TCTTTCTTTG TGATTTCTTA 

57001 AACTACTAAA ACATTTTCTT GGCCAATATA TTAGATTGAG TTAAGAATAG 

57051 AAATATGAAA CTAGAGAATT AGATCTATGT TTAGTGTTTT TCACTGCGCT 

57101 AATTAAAATA ACTCTTTAGG AATATGAAGT AAATCATTAA AGAGATAAAG 

57151 CCCTTAAAGG CAGGGAGTTT AGAATTATTA AATTCTAATA ATTTAGATAC 

57201 TGATTGGAGA AGAGATGTAT TCATAAGTTA TTATTGTTAC TATTTGTCTT 

57251 TGTGTAATAT TGTTTGATTA AATGATGGCA CCGACTTCAT TAAGTTTAAA 

57301 AACTCAGTAC TAGTTAAATG GGGCAACTTT TCATAAAGCT TTGCTAGTCC 

57351 TTGAGCCCTT TTATTTGTTA AATGGCTCAA CTGGAACCTA AGCTGAGTTG 

57401 TTACAAACTA TTATTTGCTT CAAGTTGTTT TCTGTTCCTG GCATGGCTTT 

57451 TTCTTTTGTG TACTGACAAA TATAAATGTT ATTCTGTTGA GTTATGGTTA 

57501 ACTATGAACA CAGAACTGTT AGGGATTAAT TTTCATATTT CAGTTTGTTG 

57551 ATTAATTCCC AGGTATTTGG CAGCATAGAT ATTAGAAAGG AAAATATTTA 

57601 AAAGAAAGTG TAAAAATAAC GAAGTGTATA GAGCGAGGGG TGGATAGCTA 

57 651 ATTAAAATTT TGTCTGGTCC TGCCTGTTCA TATGAAAAAA GGGGTTGGAC 

57701 TTTCTTCTAA GGGAATATAT TAAATTGCTT TCATCATATT TTCCTTATTT 

57751 CTGTCTGTCA AGGAAAATAA ATTGATACAT ATATGGGGAG AAAAGAGATC 

57 801 ATTTAGGGAA GTGGCTCATG GGACTTTTTG TTTTGTTTGA AGTGTATTAG 

57851 GAAGTCGGGT GTTTTTTTTC TCACTTAAAT TATTTAAAAC CCAGAAAAGA 

57901 AATGATATCT TCTGGTTTTT AAAGGAGACC ATGAAGTTCT GCATAGCTAT 

57 951 CATTGATGTG TAGTTCATAC TGCATTTTTA GAAGTGGAAA ATAGTTATTT 

58001 GGAGGAAGAT AACAAATCTG GAACCTTAGG TGCAAGGAGA AAAAGAATAG 

58051 ATGAAAGGGA AAGATGTTTG TAAATTATAA AAATTTCAAT TAGCTATTGG 

58101 TTTTCTGCAC TTTATATTTT AACTGCAGAA TTTTTCAAAA TCAGTTAATC 

58151 TTGGTGGAAT TAGCAGGATG TTAATAGGAG TGACTCAGAA AAAAACATTT 

58201 TGTGACTGTC TAAGTTTGGA AAGTATTGGA TTAAATACAA TTGAGGTTTC 

58251 TTTACTATGG AACTCCTCAG AACTTATAAT ATGTTGATAT TCTTTGATTC 

58301 CCAGATGAGG GGATGGGTAA TAGGATACAT GGTTTTCCAG ACTTGTTTGA 

58351 AAATGCAACT ATTTTTGGGT TGCAGGGAAG GATATAGTAG AACTCATGGG 

58401 AACTGGTGTT TCTTGGAACA TGCTTTGGAA ATGCTGGGTT ATGCCCTGTT 

58451 AACTCTTACA TCATTAGTTT TTAGCCCAAA AGGAAACAGC AAATAATGTT 

58501 TTATATGAGC CACATTTTGC GTTGATTTTC CTTCCACTCT GTAAAATTAC 

58551 TAAAGCAGCA CTCTGACTTT ATTATGCTCA AATCGCTCTT CTCCATTAAT 

58601 GTGTGTTTCT CCATCTTTTA GGGTTTTTAC TTTATAAATA CAGAGATTAC 

58651 TGTGTAAAAT TCTAAATTTG CCACTGGGTC GTTATACATT TGTAACCTTC 

58701 CTCACAGTAT ATTTTGTGAT TTGGCAGAGT TTACCAATAT AGATGATACT 

58751 AACTGAAATT AATCATTCTG TATAATTGGA TAGAAAAGCA TGAGTAAGAA 

58801 TTCAATTGGT ATTATATTTA ATTAATTGCC AAGATTTTCA CATTTCCTGA 

58851 CTACAACAAT AAAATCAAAT GAATTGATGG CTTAAAAAAA AGAAATCTCA 

58901 AATGTTTAGT CAATGAAGAA CATCTATTGA ATGAGTGAAT GTTCATTATA 

58951 TATAGTGCAT TTTCTGAGCT TTTTTGGAGG GGGAAGTTGC TCCCATGCTC 

59001 TGAGAACTTT TAAGGATCGA TACATTATTT TTAACATAAT AATGAGAAAA 

59051 CATGAGCAGA GAACCCATTT CTGTCATTCC CATTCTCTAT CCTCCTGCTC 

59101 CCCCACCTCC CACCCCAGCC ATCAAGCTAA GTAACTATTT TACACCTGGA 

59151 CGTAGCTATA GGAACAGGCT ACTTTGAAGT CTCCTAGTGA CATCCTTCAA 

59201 GTCTGAATGT TCAAAGGCAG TTTAACAGGG AGGTTGACTT AATGAGATCA 

59251 TCAAGGAAAT GTCCAGTCAT CCTGAAGGGT ATTTTGGATG GGCTTCCAGA 

59301 ATTTAAAGAT TAAAGTTTTT TTAAGGTTTT TTTATTTTCA CTGTTTATAT 

59351 TGCCACATTA ATTTCCATTA TAAAACCAGT AACCATAGTT TTGTTTTAAT 

59401 TAGCAATCTA ATTATTTTCA TGTATCCTCA TTATGAGAAT TTATGTCCAT 

59451 CACTTTGCTT GATGTGATAA CAGTGACATG CTAAATGAGA AACAATTGTT 

59501 ATTTAGAAAA AAATGCACAA AGTGAAAGTC CTTTTAATCC CTAATCATAA 

59551 ATACATTTTA TTAGCTTACT TTAAGAAGTG GCAGTCACAG CTCCTGAACA 

59601 TTAGGGAGTG TTTCTTTTGG TCAGCATTAT TTATTTAGTG CACATTGCCT 

59651 TTAATTTTAA TTTGAAATTA TAGTAAAATC CACGGGAGTT TTTAAGTCTC 

59701 CTCACAGCCT TTTGCTACCT TTTCACCAAG GTAGATCCAG ATGATAACTG 

59751 CTGTGTTGTG ACATCATAGA AATTAGAAAA ATATTTTCCT CTGAGGAAAG 

59801 AACATTGTAA ATGAAACTCT ACATATCAGA GGTCTATAGC TATGTATCAA 
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TATTAAGTTT 
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AGATTAGTAT 
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ACCTGGAACT 
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CATTCTTATG 
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CTGTTTCAGG 
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GGTGAAACCC 




61701 


TGTAATCCCA 




61751 


GGACCCAGGA 




61801 


CAGCCTGGGC 
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AAATGAAAAA 




61901 


TCTTCATGTG 




61951 


TTCCTAGTCA 
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TTAATCAAAA 




62051 


GTGGAAAGGC 




62101 


GGATGGTTGC 




62151 


AAATACAAAA 




62201 


ATTTAGCTAT 




62251 


GAGTCTTCTG 




62301 


AAGATATCCT 




62351 


ATAATGTTCA 




62401 


GAAGTTTATT 




62451 


CTTTTTTTTC 




62501 


TTTGCAACAT 




62551 


GGAGGGAAAG 




62601 


AAAGCAGCGT 
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GATGTGCCTG 
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CCTCGAGCAC 
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CCCTATTTCT 




62801 


GGTCTTGCCC 




62851 


TATAGTTAAG 




62901 


TCTCTCTCTC 
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TAAATCGGTG 



CTTTTGTACT TTGCTTTGTA 
TATTTTTACT TTAAAAAGAA 
TGTGTCACTT TTGAAAGTCA 
TTAAATCATT TTGTTTTTAT 
GTTCATGTTT TTATTTTTTC 
CTTTTAATTT AATTTCTCAC 
ATTGTCTTGG GGATCATCAC 
ACTAAAGCAG CACGTCGAAC 
ATTAGACACC TTAGAAGTCA 
CTGTCTTTCC CAGATATCTG 
TTACATTTGT ATCTGATAAA 
TAAAATCCCA TTTATAGAAA 
CATTCATTAC AATTTCAGAA 
ATATTATTGG ACATTCTTAA 
GGATACACCG GTCTCAGAAG 
TCCCCGCAAG CCCTTTTTTT 
GTCTTATGGT AAATCTAAGC 
TAGAACTGCT TACAGTTAAA 
TGTAATGTGT ATATTATTTG 
ACCCTGCATA TCTTCAAAGT 
AATTGTACCT AGATTATGTG 
GCTATAAGCA GCTTTTAGAG 
TCCTAGTGGT TTCATGACTA 
AAAAATGTCT GGCTTTAGCA 
TTGTCATACC TGTATGAAAC 
TTATGTATTC CTGTGTGTAT 
TTTTTTAAAA AACTACTAAC 
TGCTTTCTGC TTCACCTATA 
CACCTATCTT TTAATGTAAG 
TGTATCTTAA CTTTCCTGGC 
TGTCCTAGAA ACTTGGATTC 
TTTGGGATTG ACGTTCTGTG 
AGTATGTGCT TTCATGAAGA 
TCTTTGAAAA ATTGGTTAAT 
CAGTGGCTCA CGCCTGTAAT 
GATCACAAGG TCAGGAGTTG 
TGTCTCTACT GAAAATAATT 
GCTACTTGGG AAGCTGAGGC 
GGCGGAGGTT GCAGTGAGCT 
TACAGAGCGA GACTCTGTCT 
GAGAAAATAT TGAGAGGATT 
ATGGAAATCA ATTTTCCTTC 
TACATCCATC CAGTTTTTGT 
TAGCTCTGCT TCATTGAGGC 
TGTCATACTA TTAGTGACCA 
TGGATAATTT TAATATCTTT 
TCAGGCTGCT TGCTTTGCTC 
AGATTTAGCT TCTCCTTTTA 
TTTATTCCTT TCAGGCGCCT 
CCGGCATATG GCCCAGACGG 
ACTGAATCTC ACAGATGAGG 
TGTTGAATAG CACAACTCTT 
CTCCTTTACA AAAAAAGAAA 
GGTTTGCAAA TAATGCTGGT 
GAGAGAGAAG GAAAGGAGTG 
ATCAACTCCT ATTGTTCTGC 
ATAGTGCAGG CTTGCGCCTC 
CTGGCCTGTT GCTCCAACAT 
AGAGGGATTA CTTTGAATTG 
TTTTTTTTAA TCAAAACTGT 
CATTACCTTT CTACATTCCA 
TCTCTCTCTC TGTCTCTACT 
TATCCTTTTG AAGCAGTCCT 



GTCATCTTCA TTCCAAACTT 
AAATAACCCA CCAATATTGA 
GTAGAATTTA TGCAAAAGGA 
TTTCTAAAGT TCATGAGACT 
TCTCATTCTT TATCATTATG 
ACAGTTATTA GCATAATAAT 
AAAGAAGAAC ATATTAGAGC 
CCTTGGTGAT TAGATATATC 
GGAAGCATGA AACTTGTGAA 
CTGAACAAAA ATATCCTACT 
ATGTGTCTGT AAGATAAATT 
GTAAGCAAAA GTTAACATCT 
CTGTAAACAG TTTGGTAGTG 
AGTGAATATG GCAAATCTGT 
ACACCTGACT GGTTAAAAAT 
TTTTTTTAAA TGTTTCCCGA 
TCCTAAAGGA TTTTAAAGGA 
TGGATTTTTT AATGGGCACA 
TGATCATAGC ATTAGTTCTT 
CACAGTGTGT GTCCTGCCAT 
TGTGCCCCTT TTGTATGATG 
TCAAATGCAT TCATTTTAAC 
CAAATTTGAA TTATCTTACT 
ATTAATGCCC GAAATTATTT 
CTGTCCCAGT TTGCTTAAGT 
GCTAATATTT CACAAGTGTT 
CAGAATATTA TCGTAGCTAC 
ATAATCTTTT AGGACTGCCT 
CATTAACAAC TAAGACTTTC 
CTAAATCAAA AAAAGGAAAA 
TTTTATAGAT TTGTTCTTGG 
CTGACCATTT TATATGCATT 
TTCTGATACA AGTGGGCAAT 
TTTGGTTAAA AAAGGGAAAG 
CCCCAGCACT TTGGGAGGCC 
AAGCCCATTC TGGCCAACAT 
GGGGCATGGT GGCACATGCC 
AGGAGAATTG CTTGAACCGG 
GAGATCGCGC CACTGCACTC 
CAAAAAATAA ATAAATAAAT 
TGGTCATCAT TTTACTGCTC 
TCAAATGGGA TCAGTATCAT 
TACTTTTTTG TTGGCATACA 
ATGCAGTCCT CAGACTCTCG 
TAGTAACTTT TTATACCAAA 
ACCAATAAAG TACTTTTTGG 
TATTCCTGTC AACAAAAAGG 
TTTTCCCTTT TATTTCATAG 
CCTTGGCATT ATAACAAAAA 
CAAACCAAGA CCCCGCTTCA 
AGAGAGAAGA AACGGAAGAG 
TAACCTGAGG GAGTCATCTA 
GGAAATATAA AAGCCGGGTT 
GGAATGGAGG AGTTGTTTGG 
AGGTATTTCC CGTCTAACAG 
ACTGGATGCA TTCAGCTGAG 
AACAGAGATG ACAGCAGAGT 
TGCAAAGACA CATTATCAGT 
AGCCATCTAT AAAACTGCAA 
TCTGTTTAAT TCATGAATTG 
GAAGAGCCTT TATTTCTCTC 
GAGCTGTAAC AAAGCCTCTT 
TTCTCATATT GAGATGTACT 



FIGURE 3, page 



20 of 27 



63001 GTGATTTTAC TGAGGTTTCA TCACAAGAAG GGAGTGTTTC TTGTGCCATT 
63051 AACCATGTAG TTTGTACCAT CACTAAATGC TTGGAACAGT ACACATGCAC 
63101 CACAACAAAG GCTCATCAAA CAGGTAAAGT CTCGAAGGAA GCGAGAACGA 
63151 AATCTCTCAT TGTGTGCCGT GTGGCTCAAA ACCGAAAACA ATGAAGCTTG 
63201 GTTTTAAAGG ATAAAGTTTT CTTTTTTGTT TTCCTCTCAG ACTTTATGGA 
63251 TAATGTGACC GGGTCTTATG CAAATTTTCT ATTTCTAAAA CTACTACTAT 
63301 GATATACAAG TGCTGTTGAG CATAATTAAA TAAAATGCTG CTGCTTTGAC 
63351 AGTAAAGAGA AGGAAGTATT CTGATTAGCT GTATCTGGTA TTAATTGCAT 
63401 GTTAAAACAC TGGAATTTTT AAAATTGAAA TTAGATCAGT CATTCTTTTC 
63451 TTTTCTCAAG ATATCTCATG GCTGACACTG AAGAAGAAAT GTAATTCATA 
63501 ACTTGCACTA AATGTATATT TTTTTTCTTA AAAATTTACC ATTCTTATTT 
63551 ATATTTTTAT GGATTAAAAT TTATAAAATA CAGATCAGTT AATATTGCAC 
63601 TTAAGTAATT TTACCTTTTT AATGTGATTT TTATAGAATA ATTCAGACTT 
63651 ACAAATACAG AGATATGAAC AAAGTTTACA GTGGGAACAA AGGTTTAAAA 
63701 AAAGGTTGTG GTTCTCTCTC TGTGATCCAG TGTGCACATA AACCTTTCTC 
63751 TGATCTTTCA CTGCCATCCT CTGGATTATG TCTTCTGACC TGTCCATTTT 
63801 GACCCATTAA CTGGAAAGTT GAAAAACTAC ATTAACTGGA AAGTTGAAAA 
63851 ACTACATTAC TTTGGAGAAT AAAACCGAAA GTTCGTGTAT ACCTTCTTAA 
63901 AAAAAAAATC AAACCAAAAA TGTGAAAACA ATAGAATTGC AAAGATAGCA 
63951 GTTAAAATTT TAATCTGAAA ATAACCTTTG AATCTCGGGC TAGGTTATGT 
64001 CCATATTTGA AGTGGTCAGT GATGGTTTGA ACATTTTTTG CAGGATGAGT 
64051 TAAAATGCAC TGGATTATAT TTGGGATTTT TGTTTTTGGA ATTGTCTGTT 
64101 TTAATCACAG CCTTAATTCA CAATTGGCAA AGGCAGTTTA CTCAAAGGAC 
64151 TGGGCTAAAT ATTCTGTAAT TATGCATTTT TGATAGGAAA ATGAAATTTT 
64201 TGCAAACAGA CATTTTCTTT TTTTTTGGCT GGAGTGCAGT GGGGCATGGT 
64251 CTTGGCTCAC TGCAGCGTTG ACCACCTGGG CTCAAGTGAT ACTCCCGCCT 
64301 CAGCCACCCA AGTAGCTGGC ACTACGGGCA CACGCCACCA TGCCCAGCTA 
64351 ATTTTTTTGT ATTTTTAGTA GAGATGGGGT TTTGCCATGC TGCCCAGGCT 
64401 GGTCTCAACT CCTCAGCTCA AGCAATCTGC CTGCGTGAGC CTCCCAAAGT 
64451 GGTGGAATTA CAGGCGTGGG CCACTGCGCC TGGCCCAGAC AGACATTTTC 
64501 TGAAACACAA CTGGCAATGA GCTGTTTTTA CATTTTGAAA GTGATTCTTC 
64551 ACTTCCTAGT TCTTAATTAT AGTATACCTA TTAAGATCTG TAAGATCCTG 
64 601 AAGACATAAG ATCATGAAGC CATATAAGAA TGAGGATTGA AAGTTGAGCA 
64 651 AAATTTTCGG GATTTTGGGA AACATTCTTA GCTGTGCTAT CTGCCTAAAA 
64701 TTATTCCTTA TTACTTCTCT CCTTTGACAG ACTTCAAGTT TTCTTCATAG 
64751 CCCTTTCAAA GTTTTTTGAG CCATCCAGAG TAAAATCATT TCTAAATGAT 
64801 AGTTCTGTAT ATCTCCAACT CGTCTTAAGT GTATTTGCCT GTGTGCAACG 
64851 TATTGCTAGA CTATGAACTC CTCAGCATGG CTGCTGGATA ACTTAATTGT 
64 901 CCTGAGTTAA TAGCCTTCAA AGGACAAATC GGTTTCTTTG CAGATAGCTT 
64 951 CGTAAAACTT CACATGGAGT TTATTTTATC ATATTTCCCT TTTTTATTTC 
65001 TGCTCCTCCT TTT^TTGCCC ATCTTGCTTC AGAGACTGAC ATTTCAGGGT 
65051 GGATATTAAT TAAAGCATTA ATTTTGTTTT TTGGTATATT TCTATCCCTA 
65101 GTATTTCTAT CTTACTGCTA AAATACAGGA AAAGTGCCGT ATTTTTAATG 
65151 CATTTAGTGG TTTTCTTTGG TGTTATCTGT TCCATTTTTC TTTTTCATAC 
65201 ATTGAAGTGT GTCTCCTTTT CAACCAAAAT AATGAAATAG TGGAGACCAT 
65251 GAAATTGTTG TGCCTGGCTA ATTGGCAAAT TAATTTACCA ATATAATAAG 
65301 TGTAGCGCCT TGTTTGAATA CCCTTTTTGA GAAGGTATGA TGAGAATGGG 
65351 CAAGGGTGT (SEQ ID NO: 3) 
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0RF(5') 
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Context : 



DNA 

Position 
1275 



GCATTTCAGGAGGAGAATCTCCCAGTCTAGAGGAATCCTCTCAGAGGTAGCTATAAAATA 
TTGAACTCTGATCTTCAATAAGCATTGTGCGGTTTTTGTTTTTGTTTTTAATGACAGTTT 
TAAACAAGAAAGTTGCTTTATTTCTGAACTTCATAAAAATTTCTATTAAAGAGACAATTT 
CTGAATTTTATAACAATTTCTAGAACAGTTGAGTACCTCACTTTGAGACACATTTTTGCT 
AAAAGTTAAAAACACAAAACCCTTATGAGATAAAATAGGAAGCTAGTAGAGATAGGAAAG 
[T,C] 

CCTCTGCTTAGTAAACCTCTTTTTTGCGTAGTTTAGACACATACAATAGTAAAGTTACTT 
AGTACGTTGATAGTTTTCTTTCTCCTCAAAAGCTACAATGTCTTACTAGCTAGTTCCTTC 
AAGAAAGGAAACAAGAAGCCGCTGGAGGAGATTGGTGAGTGGGATAAAACACTATTCAAC 
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TCTTCAGTTATTCGGTTTTTAAATCCTCAATGAAAGGCTGCTGTATTATAGAGTATTTTT 
TTTTTATTTTTAATAGACTTAGAACCAAGTTTCTTGAGAAACCTTTGGCATATTGTAGTT 

TGAATTTTATAACAATTTCTAGAACAGTTGAGTACCTCACTTTGAGACACATTTTTGCTA 
AAAGTTAAAAACACAAAACCCTTATGAGATAAAATAGGAAGCTAGTAGAGATAGGAAAGT 
CCTCTGCTTAGTAAACCTCTTTTTTGCGTAGTTTAGACACATACAATAGTAAAGTTACTT 
AGTACGTTGATAGTTTTCTTTCTCCTCAAAAGCTACAATGTCTTACTAGCTAGTTCCTTC 
AAGAAAGGAAACAAGAAGCCGCTGGAGGAGATTGGTGAGTGGGATAAAACACTATTCAAC 
[T,C] 

CTTCAGTTATTCGGTTTTTAAATCCTCAATGAAAGGCTGCTGTATTATAGAGTATTTTTT 
TTTTATTTTTAATAGACTTAGAACCAAGTTTCTTGAGAAACCTTTGGCATATTGTAGTTT 
TTTTATGGCTATGACTCACATGACATTACTGTATAAAACTAGTACATTCTCTCGTAAAAC 
CACACAAACTTACTAGAGTGCTGCTCTCATTTTTCTACATTAGAAATGAAAAAGGGCATT 
GTCTGCATTCAAAATTTCCTTTTTACATCTCTGTATTACTTTTTCCCCTTTATATTTATC 

TCTAGTTGACAAGACTGAGGTAAGGAATTGTTAAGGAAAAGTCAGAATTCCATCCAGATA 
TTTGGCTCATACTTTAATCATGAGGCTAAACTGCTTCTCTCTACACGTATCTTCATAGTA 
ACTTGTGTTTTAAGTCTGGTAGAAGCATAAGAAGTTTAAACACAGACAGAATCCTGTGGA 
AGTTAGTAAATTTCTAGTGAACGATAGAAATGATAGAAATCTCTTCTTCCCCCAAAGTCC 
CAAGAACAGATTAGTCTGCTTTTGACAAGTGTTATCAAAGTAGACTGTTCTCACATACAC 
[G,A] 

GGGGACTCAATAGGGCATTCCTGGTGGATATAATAAAATGAGTAAATGCGATAACAGGAG 
GAAATGCCTAGTGTGTTGCTCTTGGATTAGTTTTGATACAACAAAGGCAGCTTTGTTGTG 
AGTCAGTAGAGAGGGTAGTGTAGAAAGGTGGAAGTTGGAAGAGTGGCAGATCCTAGAGGA 
CTAATGATGGGCTTAAACCACAAAAAGTGTCGCTTTGCCATTGAA 

ATAAAATGAGTAAATGCGATAACAGGAGGAAATGCCTAGTGTGTTGCTCTTGGATTAGTT 
TTGATACAACAAAGGCAGCTTTGTTGTGAGTCAGTAGAGAGGGTAGTGTAGAAAGGTGGA 
AGTTGGAAGAGTGGCAGATCCTAGAGGACTAATGATGGGCTTAAACCACAAAAAGTGTCG 
CTTTGCCATTGAAATAAAAGTTTGGGGTCTTATTTTTTCAATTTTCTCCCTGAAATTATT 
TCTTGACATTCATTAGCTCAGCAGTGTATCTAAATAAAGCTTTTTTGGGTTTCTATTATA 
[A,G] 

TAGAGGTTTGTTCCTTTTTCTTCCCTTTGAAAAGTATCATTTTTTGCACATTATTTGAAA 
ATCCAGGTGTTATATGATATTCTTATTGCCAGAGGGACATTCTGCAGGCTCTTTGTAAAA 
TGATTTTAGGATTCAGATACTTATTATATTTTTATTGGCCCTAATATTTTATCCAACTAG 
AAAATTAAACCTCTTCTTAAAAATTAATCCATCTAAGTGTCTGTAAATTAAAGGAACAAC 
TAAAGATTCTTTATTTGGTGTCAGAAACTCCTTGTTTCTACAACAGTAGTATAAAACAAA 

ACATGTAAACCAACAATGAAATTATTTTAGTGACTTGAGAATCAAAGTGCTAGAGTTTGA 
ATCCCTGTTCTACTACTTGCTAGCGGTGTGACCTTGGGCCTGTTTAACTCTTGACACCTT 
GTTTTCCAAATTTATAAAGTGGAGATAATAATATCTGTCACATTGTGTTGTTGTGAGGAT 
TATATGAACTAATATATGTAATGTCCTGAGAACAATGTCTGGTACACATTAAGTTAATTA 
AAATTAGCTGTTCTTACTGTTATTATTAGACATGAGCTAGATAACAGTGGCCTCTACATG 
[T,G] 

GAAAGATTATTTTAATTCTGATGTAGTTCAGTTTATCTATTTTTTTTATTTTTGTCCCTT 
TTGCATTGATGTCATATCTAAAAAACCTGCCTAACTCAGGATCACAAAAATTTACTCCTG 
TATTTTATAATTTTAGCTCTTTAGATCTAGGATCCATTTTTAGCTAATTTTTATATATGG 
TGTGAGGTAGGGGTACGGTTTCATTCTTTTGCACGTGAATAGCCAGTTGTCCCAGCATCA 
TTTATTCAAAAGACTATTCTTTCCTCACTAGAAAAAATATTTCTTTAAAGAATAATGAAT 

CCAGGCTCCCTTGAACTCCTGGGCTCAGATGATATAGCCTCCTGCCACAGCGTCCTGATT 
AGCTGGGACTACAGGTGTGCACCACTACACGTGGCTTTCCTGATGAAATTTTAAATACCC 
AAATATTTGAGCAGAAATAATAGCTTGTGTTTATTGTTTTTCTACTATCTGTCAAGTATA 
GTATTAAATGTTTTACATAATTTGTCTCCAGTCCACATACAATACTCTAGTAGAAGTGGG 
TAACAAAACCAAGGTACTCAAAGAGGTTAATAAGTAACTTGCGCTGGATCACAGAACTAA 
[C,T] 

GGGAGGCAGGGCTGGAATTTGACTCTAGGTCTTTCTGACCTCAAAGTGCAGTAAAGTCAT 
GGAATTTCTCTACTAGGCCACCTGGAAGAAAAGTGATCTTTTTTCCAGTCTTTTTTGTTA 
CTGTTTTTCAGCCAGGAGATAGTAGAGTTAGGTAGTAGAATAGTAGTCACTGGCATCCGG 
TAGTCAGCCCTCCAAAAAAGTTTTTGATTTTTTTTTTTTTTTTTGTCTTAAACTTGGAAG 
CTACTAACTTTCAGGTCATACTTTCTTATCATCCAAGAGCTGGATATTTAGGTAGCAGAA 

CTCTAGTAGAAGTGGGTAACAAAACCAAGGTACTCAAAGAGGTTAATAAGTAACTTGCGC 



FIGURE 




TGGATCACAGAACTAACGGGAGGCAGGGCTGGAATTTGACTCTAGGTCTTTCTGACCTCA 
AAGTGCAGTAAAGTCATGGAATTTCTCTACTAGGCCACCTGGAAGAAAAGTGATCTTTTT 
TCCAGTCTTTTTTGTTACTGTTTTTCAGCCAGGAGATAGTAGAGTTAGGTAGTAGAATAG 
TAGTCACTGGCATCCGGTAGTCAGCCCTCCAAAAAAGTTTTTGATTTTTTTTTTTTTTTT 
[T,-] 

GTCTTAAACTTGGAAGCTACTAACTTTCAGGTCATACTTTCTTATCATCCAAGAGCTGGA 
TATTTAGGTAGCAGAAACTATGGAATTATCCTAAGTCCTCTTGAAGCTTCAGCTGTTAAA 
ATTAATTGGTTCTGATTAACACTGTGCTCAAGATTTACATTTCTAGGAGCCACAGTTTGA 
TTGGTCTAACTTGGATCTATGTGTTTTCTTTAGCTGGGGAGGAGAAGGTATCTTGATTGA 
TACCTTCACCAGGACTGCATGCAGTGAGGGACAGAAGTTTCCTTAAAATAATTGGGTTCT 

TTTATTTTCTGCTACTATGGCAGAATTGAGTTGTTGCAACTGTGTGGCATCCAAAGCCTA 
AAATATTTACTCTCCTGGCTCTTTGCCAACCCGTTTTAGATTATGAGCACTTTGGCATTA 
TTATGTTTTTGTTTTCTTTCTATAGCACACAGTAAGATGTTCTGCCCACATTGTGCATAA 
TTTATGGGTTTATTCAAGGATTTATGCAAGTGTAGCTGCAAGAAAAAAACCTAGAAGTGA 
ACTTGCTAGGTTGAAGAGCA 



CTGTGTATGTTAAATTTTGTTAGCTTTCGCCTTCCCAAAGGGATTATTCCATTTCATACT 
TAAACTACTAATTTTGTGATAGGACTTCTTTCTCCATAGCTTTGCTAAATTAATGCATTC 
ACACACTTCATCTTTACTAATCTGATAGAGGGAAATGATATTGTGGATTTGATTTGCATT 
TCTTTTTATGTGTTAGCTTGAGCTTATTTTCATATTTAAAAGCCAATTGTATTTCTTTTT 
CTTGAGCTATCTTTTAATGT 

TTTATGCAAGTGTAGCTGCAAGAAAAAAACCTAGAAGTGAACTTGCTAGGTTGAAGAGCA 
TCTGTGTATGTTAAATTTTGTTAGCTTTCGCCTTCCCAAAGGGATTATTCCATTTCATAC 
TTAAACTACTAATTTTGTGATAGGACTTCTTTCTCCATAGCTTTGCTAAATTAATGCATT 
CACACACTTCATCTTTACTAATCTGATAGAGGGAAATGATATTGTGGATTTGATTTGCAT 
TTCTTTTTATGTGTTAGCTTGAGCTTATTTTCATATTTAAAAGCCAATTGTATTTCTTTT 
[T,-] 

CTTGAGCTATCTTTTAATGTCCTTCCTGATACATTTCTGAAGTCTGTGATACTCATATAA 
GATATATGGTGAACATGTGTCAAAGATTTATTTGACTCTAATGAGGGAACCCGCCTGATG 
ACAAGGCTGATTGAGAAGAGGATGTGTGAGATGAAGTGTATATCATCAGTGAAAGAAAGC 
AAATTCTTACAGGGCAAAAACAAAACCACAACTCTAAGGGTTATTGTTTCTACTGGACAG 
AATTCATTTGCATTTTACCAGATAAAAATTACTATTTTCAATTTATCTTTTACAAATCAT 

CAAAGATTTATTTGACTCTAATGAGGGAACCCGCCTGATGACAAGGCTGATTGAGAAGAG 
GATGTGTGAGATGAAGTGTATATCATCAGTGAAAGAAAGCAAATTCTTACAGGGCAAAAA 
CAAAACCACAACTCTAAGGGTTATTGTTTCTACTGGACAGAATTCATTTGCATTTTACCA 
GATAAAAATTACTATTTTCAATTTATCTTTTACAAATCATTTTCTAATTTTACAGAGTCT 
ATTCCCTAATCAGTAGTAAATAGTCTTCAAAATTCTCCGCAGCGTCAGGTGACTATTATG 
[C,A] 

AGGCTAATTGTTGACACTCGGGCTTGACTTTAAGAGAACATGCCATAATCTTTTGGCCTT 
ACTTCCAAGTTTTGGATAATTTTTCTTAACACATTTTTCTCTAATTGCAATGATTTCAAG 
TGATATTATTTCTTTTTTTTAAATTTTTTTACTATTTATTGATCACTCTTGGGTGTTTCT 
CGGAGAGGGGGATTTGGCAGGGTCATAGGACAATAGTGGAGGGAAGGTCAGCAGATAAAC 
ATGTGAACAAAGGTCTCTGGTTTTCCTAGGCAGAGGACCCTGCGGCCTTCCACAGTGTTT 

TTATTGTTTCTACTGGACAGAATTCATTTGCATTTTACCAGATAAAAATTACTATTTTCA 
ATTTATCTTTTACAAATCATTTTCTAATTTTACAGAGTCTATTCCCTAATCAGTAGTAAA 
TAGTCTTCAAAATTCTCCGCAGCGTCAGGTGACTATTATGCAGGCTAATTGTTGACACTC 
GGGCTTGACTTTAAGAGAACATGCCATAATCTTTTGGCCTTACTTCCAAGTTTTGGATAA 
TTTTTCTTAACACATTTTTCTCTAATTGCAATGATTTCAAGTGATATTATTTCTTTTTTT 
[T,A] 

AAATTTTTTTACTATTTATTGATCACTCTTGGGTGTTTCTCGGAGAGGGGGATTTGGCAG 
GGTCATAGGACAATAGTGGAGGGAAGGTCAGCAGATAAACATGTGAACAAAGGTCTCTGG 
TTTTCCTAGGCAGAGGACCCTGCGGCCTTCCACAGTGTTTGTGTCCCTGGGTACTTGAGA 
TTAGGGAGTGGTGATGACTCTTAATGAGCATGCTGCCTTCAAGCATCTGTTTAACAAAGC 
ACATCTTGCACCGCCCTTAATCCCTTTAACCCTGAGTTGACATAGCACATGTTTCAGAGA 

TTTTTTTTTTGGAGGTCGGGGGACTGTCGCCCATTCTGTTGCCCAAACTGGAGTGCAGTG 
GTGCAATCTTGGCTCACTGCAACCTCTGCCTCCCAGGTTCAAGCGATTCTTGTACTCAGC 
CTCCTGAGTAGCTGGAATTATAGGTGTGTGCCATCATGCCAAGCTAATTTTTGTATTTTT 
AGTAGAGATGAAGTTTCGCCATGTTGGCGAGGCTAGTCTCAGACTCCTGGCCTCAAGTGA 



[A,G,T] 
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TTGGCTGACCTCAGCCTCCCAAAGTAGAAAATCTTCTTGAAAAATA7\AATTCCAAATCTC 
[A,G] 

AAAGGCCCTATATAATTTTGGTGTTGGAAATTTACTTGTCAATGAAAATGACTATTTACA 
CAAATTATAAGCTTCCATATTAATATATATGTGTGTGAACCTGAAATTCAAATTTTATTA 
TATTGTTTATGAAAGGTACAGCCTCTGAGATTCATCAGATGGTATTTACCTTTAGGGCAT 
ATCTAAAAATAAAATACAGTACATGAAATCCAGTGCTTTAATCCAGTGATTCTTAAACTT 
TTTGCTCTCAGATCCCCTTTAAACTCTTAAAAGATATTGAAGAGCTCCAAGGAGGCTTTG 

GACTCTACCAATGGGATCGGAGCTCTCCAAACCTGCATATTAAAAGGCCTATAAGTTTTG 
GGGGGTCCCTTTGTCCACATGATTATTCTGTAATACATTGTATTTATGGACATGGTATTA 
TTATACACAGATCCTGTCTTTTAAAGAACATTATAATCCACTTAACTGCTAGGACCAGAG 
AATGACCGATAATTCAAACCATATTGTCTTACAGAAGACATATATAAAAGATGGTTATGT 
GTACCAATTGAGGTTCAAATTTGATTCAATTTAAAACAATCTAGGCCAGATTTTATATAG 
[T,A] 

TTGTGGACCCTTTGCACTCAAATCTCAAGGTTCTTATTAAAATGCAGATCTTGGCTGGGC 
ACGGTGGCTCACACCTGTAATCCCAGCACTTTGGGAGCCCAAGGCAGGTAGATCATTTGA 
GCTCAGAAGTTCAAGACCAGTCTGGCCAACATAGCGAGGCCCAGTCTCATTGAAAGAAAA 
AAAATTTTTTAATAAAAAATAAAAGCAGATCTTGGGTAAAGACATGTAGTCTGGTTTACA 
GGTATTAACAACTGTCTGTAATGTAGTGATTTTGCTCCAGACTTACCTTTTCCATTATTT 

TTACTGTGAAGGCTGATTTTTTTTTTCTCTCACCACTAATTTAACACATGACTAGGCAAA 
TTTTCAGACTATTTAGTTAAACATCAAGAGCCTGGAAGAAGTATCTTGTGACCTAATGTT 
CTTTGACGGGTTAGTTGTTACTTTGCTGTTATGACCCTGAATTTTTTTTTTTTGAGACTG 
AGTCTTGTGCTGTCGCCCAGACTGGAGTGCAGTGGCGCAATCTCAGCTCACTGCAACCTC 
TGCGTCCCAGGCTCAAGCAATTCTTGTGTCTCAGCCTCCTGAGGAGTTGCGATTGCAGGC 
[A,C] 

CCTGTCACCATGCCCTGCTAATTTTTGCATTTTTTTGTTTGTTTTTTTTTTTTAGTAGAG 
ATGGGGTTTCACCATGTTGGCCAGGCTGGTCTCAAACTCCTAACCTCAAGTGATCACCCG 
CCTCAGCCTCCCAAAGTGCTGGGATTACAGGTGTGAGCCACCACACGTGGCTATGACCCT 
GATTTTGATTCATTCACTTTTTATAATTACCTTTTGATTAGATAAGTTAATTATTCTTGA 
ATTTGGCCATTTTATGCTTTGAGAAAGTAGTTAATCACAGTGGGTCAACAGTACAAACTT 

ATCCATCACCTCAAGCATTTATCCCTTGTGTTACAAACAATCCAATTACACTCTTAATTA 
TTTTTAAGTGTACAATTAAATTATTGAATATAGTTCAAAGACTTCTTCATTCATGACTAG 
CACCTAGGCTAAAAAAATTCAGACACCTGGGCTCCTGGGATCAATCACGCATACTGTGTC 
TCTTGTGCTCACTCCC 



CTGTCTCTCTCTCTTTCTCTCGCTTCCTTTTTCCTCTCTCTCTGTGGTTTTCTAGGGTGG 
TGGCCTCAGGGAATTGGATTTCTTATATTATAGCTCAGGATTCCCAAGAGGGCTGTTTTT 
AATGTAGCCAAAGAAGTCTTGCAGCGTGACTTGTTTTATTCTATTCATTGAGGTAGTCAC 
AGAGGCCCGACCACAT 

TACAATTAAATTATTGAATATAGTTCAAAGACTTCTTCATTCATGACTAGCACCTAGGCT 
AAAAAAATTCAGACACCTGGGCTCCTGGGATCAATCACGCATACTGTGTCTCTTGTGCTC 
ACTCCCGCTGTCTCTCTCTCTTTCTCTCGCTTCCTTTTTCCTCTCTCTCTGTGGTTTTCT 
AGGGTGGTGGCCTCAGGGAATTGGATTTCTTATATTATAGCTCAGGATTCCCAAGAGGGC 
TGTTTTTAATGTAGCCAAAGAAGTCTTGCAGCGTGACTTGTTTTATTCTATTCATTGAGG 
[T,C] 

AGTCACAGAGGCCCGACCACATTCAGAGGAGGGACATACACTTGCTGGGACAAGTGTAAG 
AGAATTCATGATCATGTTTTAAAACCACTTTTATTAGTTTCCTATTGCTGCTGTAATAAA 
TTACCACAACTTAATGGCTTAAAAGCCACACAAATTTAATATCTTACAGTTCTGCAAATC 
AAAAGTCTGAAACGGATCTCACTGTGCTAAAATTAAGGTGTTCGTAGGGCATTCTGGAGG 
CTGTAGGAGAGAGTCTTGTTTTTTGCCTTTTCTGGCTATTAAAAGCTGCCAGCATTCCTT 

TGTATATCAGTCAAAATATTGGGCAACTCTGATAAGTTTGTCCACTTAACATTGTACCAC 
TTAAGATGAATAGCATCTACCATTTCCGTCATTTGTAAATATATAGGAGGACATAATCAC 
ATAATCTTGAAGTAAAAGACAGTGCTTAAAACTGAATCAGTTAAGTTTTATGAAAAATAC 
TTCATATTGTACTTTTAAAAATATATATTTTTTAATTTCAATAGCTTTTGGGTTACAAGT 
GGTTTTGGTTACGTGGATGAATTCTATAATGGTGAAGTCTAAGATTTTACTGCAACTGTC 
[A,G] 

CCCAAGTAGTATATATTGTATCCAGCATATTGTCCTTTTTTTTTTCTTTTTTTTTTTTCA 
TTTCACCATGGACTMTGAAAATTTTGTTAGGGACTGACATTAGGGCACCCTTGAGCTAC 
CTTGAGCTAAAGGAAATAACCCTTGAATTTTTTTCTGTTTGGCCTAGAGAATGTGGTTTG 



[G,A] 
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TTTTGTAACTGAATTCATGGGATTGTT7VAGGTACAAGATTTTGCTTTAGTTTTATTTGTA 
CTAGGATTTTGCTATATTAATACAATGTGAAAAGAATCAAAAGTGTTAGAAATAAATGCA 



GAAGAGTAGAACATGAGGCTTTATTTAAAAGATTAGCAGAATTTAAGGAAAAGGTGACTT 
TGTTGAAGATTATAATGTGAAGACAAAGGAACGAGGATGGGAATAAATTTTGTATTCATG 
AGGCTTTGAAGAAATTGACTCTAGAGAGTATATTTTGGGTACTTTTGGGAAATGAAGTTG 
GATTAGTGAGAAGGAACAGATTATGAAAAGACAAGAAACCTGATTAATGTCAGGATGATT 
TTATATTTGAAG 



TGGTCAGATTTATGGCAGTCCTGGCTTTGCCATTTTTAGTTTGATGACTTTGAGAAAGTT 
CCTTCTTGAAGTTTTAATTTTCTGTATATAAAAAGTAATAACACCTGGTGATCTGCTAGG 
TTTGTTTTGAGGATTATATGAGATAAAATGCATGCAAAACTGTTATAATAGTGCCTGGTA 
AAATAAGTGCCTAGTTTTAAAAACAAGTCTTTGTAAACTGCTTAGGACATGCCTGGTATA 
GGGTAGGTATGT 

GACTTTGAGAAAGTTCCTTCTTGAAGTTTTAATTTTCTGTATATAAAAAGTAATAACACC 
TGGTGATCTGCTAGGTTTGTTTTGAGGATTATATGAGATAAAATGCATGCAAAACTGTTA 
TAATAGTGCCTGGTAAAATAAGTGCCTAGTTTTAA7\AACAAGTCTTTGTAAACTGCTTAG 
GACATGCCTGGTATAGGGTAGGTATGTAATACATAGTAGGTAGGATCTGTCTCCTTGCTA 
TTTTTAGGTAAAAAAACAAAAGGAAGAGCTTCAGCTTAATACAGTATGAACTGACGAGCC 
EC,T] 

TGGTAGGTTTTTGAGCAAAAGAGCAACACAGTAAAAGTAGTACTTAGGAAAGATTAACAA 
GGGAACATGGCTTATACAGTGGTAATGGGGCCTGGAGTCAAGGAGGTAAGATAAAATGGT 
ATTATAATTAAGGAATAGCCAGGCACGATGGCACATGCATGTAATGCCAGCTACTGGAGA 
GGCTGAGGTGGGAGGATCATGGGAGTCCAGGAGTTTGAGACCAGCCTGGGCAACTGAGTG 
AGACCCCAAATCCTAAAAAATACAAAGTAAAAAAGGAATAAAGTCATGAGGGCTTGGACT 

GCTTTGTCACCCAGGCTGGAGTGTGGTGGTATGATCATGGCTGACTGCAGCCCTGACCTT 
CCGGGCTCAAGTGATCTTTCCACCTCAGCCTCCCAATTACTTGGGACCACCAGCATGCTT 
GGCCGATTTTTTTTTTTTTTTTTTTTTTGTAGAAGCAAGGTTTCCCTATGTTGCCAAGGC 
TGGTCTTGAACTTTAGGGCTCATGTGATACTCCTGCCTCGGCCTCCCAAAGTGTTAGGAT 
TACAAGCCTGAGCCACCATGGCCGGCCAAAATATTTTCACTATAACAAATATCATATCTG 
[T,G] 

ATATACTCAGTTTTAATACTAACTCAAAGTAGAAACATAAAGCTGAATGACTATTTTATT 
TTCAGATTCTCTCCATTGAGTTTCCTTCTCCGTCTTGTGTGATCTCTGAACTTTTCTCCA 
TCTTTGCCACTTCTTGTCTAGCATTTTTTTTTTATCAGCAGTTTCATTCAGATTTTTTTT 
TTAGTTCTTTCAACGGTGGAGTGGAAGTAGGCAGCAGGACAGAAGAACTTGAAGCAGAGC 
ACACTGGAGAGGAGAAATTAACAAAGCCTTTATGAATAAAACAACCCCCCAATATCAGTC 

TGGGTTATGCCCTGTTAACTCTTACATCATTAGTTTTTAGCCCAAAAGGAAACAGCAAAT 
AATGTTTTATATGAGCCACATTTTGCGTTGATTTTCCTTCCACTCTGTAAAATTACTAAA 
GCAGCACTCTGACTTTATTATGCTCAAATCGCTCTTCTCCATTAATGTGTGTTTCTCCAT 
CTTTTAGGGTTTTTACTTTATAAATACAGAGATTACTGTGTAAAATTCTAAATTTGCCAC 
TGGGTCGTTATACATTTGTAACCTTCCTCACAGTATATTTTGTGATTTGGCAGAGTTTAC 
[C,T] 

AATATAGATGATACTAACTGAAATTAATCATTCTGTATAATTGGATAGAAAAGCATGAGT 
AAGAATTCAATTGGTATTATATTTAATTAATTGCCAAGATTTTCACATTTCCTGACTACA 
ACAATAAAATCAAATGAATTGATGGCTTAAAAAAAAGAAATCTCAAATGTTTAGTCAATG 
AAGAACATCTATTGAATGAGTGAATGTTCATTATATATAGTGCATTTTCTGAGCTTTTTT 
GGAGGGGGAAGTTGCTCCCATGCTCTGAGAACTTTTAAGGATCGATACATTATTTTTAAC 

GTTTATATTGCCACATTAATTTCCATTATAAAACCAGTAACCATAGTTTTGTTTTAATTA 
GCAATCTAATTATTTTCATGTATCCTCATTATGAGAATTTATGTCCATCACTTTGCTTGA 
TGTGATAACAGTGACATGCTAAATGAGAAACAATTGTTATTTAGAAAAAAATGCACAAAG 
TGAAAGTCCTTTTAATCCCTAATCATAAATACATTTTATTAGCTTACTTTAAGAAGTGGC 
AGTCACAGCTCCTGAACATTAGGGAGTGTTTCTTTTGGTCAGCATTATTTATTTAGTGCA 
[C,A] 

ATTGCCTTTAATTTTAATTTGAAATTATAGTAAAATCCACGGGAGTTTTTAAGTCTCCTC 
ACAGCCTTTTGCTACCTTTTCACCAAGGTAGATCCAGATGATAACTGCTGTGTTGTGACA 
TCATAGAAATTAGAAAAATATTTTCCTCTGAGGAAAGAACATTGTAAATGAAACTCTACA 
TATCAGAGGTCTATAGCTATGTATCAATATTAAGTTTCTTTTGTACTTTGCTTTGTAGTC 
ATCTTCATTCCAAACTTTCATAATTATTATTTTTACTTTAAAAAGAAAAATAACCCACCA 



[T,C] 




AAAAAAAGGAAAACATTGATAAGTGTCCTAGAAACTTGGATTCTTTTATAGATTTGTTCT 
TGGGGCTCTGATGTTTGGGATTGACGTTCTGTGCTGACCATTTTATATGCATTTTATCTT 
AATAGTATGTGCTTTCATGAAGATTCTGATACAAGTGGGCAATCCTTAAATTATCTTTGA 
AAAATTGGTTAATTTTGGTTAAAAAAGGGAAAGTGGCTGGGTGCAGTGGCTCACGCCTGT 
AATCCCCAGCACTTTGGGAGGCCGGGACGGGTGGATCACAAGGTCAGGAGTTGAAGCCCA 
[G,T] 

TCTGGCCAACATGGTGAAACCCTGTCTCTACTGAAAATAATTGGGGCATGGTGGCACATG 
CCTGTAATCCCAGCTACTTGGGAAGCTGAGGCAGGAGAATTGCTTGAACCGGGGACCCAG 
GAGGCGGAGGTTGCAGTGAGCTGAGATCGCGCCACTGCACTCCAGCCTGGGCTACAGAGC 
GAGACTCTGTCTCAAAAAATAAATAAATAAATAAATGAAAAAGAGAAAATATTGAGAGGA 
TTTGGTCATCATTTTACTGCTCTCTTCATGTGATGGAAATCAATTTTCCTTCTCAAATGG 

GAGATGTACTGTGATTTTACTGAGGTTTCATCACAAGAAGGGAGTGTTTCTTGTGCCATT 
AACCATGTAGTTTGTACCATCACTAAATGCTTGGAACAGTACACATGCACCACMCAAAG 
GCTCATCAAACAGGTAAAGTCTCGAAGGAAGCGAGAACGAAATCTCTCATTGTGTGCCGT 
GTGGCTCAAAACCGAAAACAATGAAGCTTGGTTTTAAAGGATAAAGTTTTCTTTTTTGTT 
TTCCTCTCAGACTTTATGGATAATGTGACCGGGTCTTATGCAAATTTTCTATTTCTAAAA 
[G,C] 

TACTACTATGATATACAAGTGCTGTTGAGCATAATTAAATAAAATGCTGCTGCTTTGACA 
GTAAAGAGAAGGAAGTATTCTGATTAGCTGTATCTGGTATTAATTGCATGTTAAAACACT 
GGAATTTTTAAAATTGAAATTAGATCAGTCATTCTTTTCTTTTCTCAAGATATCTCATGG 
CTGACACTGAAGAAGAAATGTAATTCATAACTTGCACTAAATGTATATTTTTTTTCTTAA 
AAATTTACCATTCTTATTTATATTTTTATGGATTAAAATTTATAAAATACAGATCAGTTA 

TGTGCCGTGTGGCTCAAAACCGAAAACAATGAAGCTTGGTTTTAAAGGATAAAGTTTTCT 
TTTTTGTTTTCCTCTCAGACTTTATGGATAATGTGACCGGGTCTTATGCAAATTTTCTAT 
TTCTAAAACTACTACTATGATATACAAGTGCTGTTGAGCATAATTAAATAAAATGCTGCT 
GCTTTGACAGTAAAGAGAAGGAAGTATTCTGATTAGCTGTATCTGGTATTAATTGCATGT 
TAAAACACTGGAATTTTTAAAATTGAAATTAGATCAGTCATTCTTTTCTTTTCTCAAGAT 
[A,G] 

TCTCATGGCTGACACTGAAGAAGAAATGTAATTCATAACTTGCACTAAATGTATATTTTT 
TTTCTTAAAAATTTACCATTCTTATTTATATTTTTATGGATTAAAATTTATAAAATACAG 
ATCAGTTAATATTGCACTTAAGTAATTTTACCTTTTTAATGTGATTTTTATAGAATAATT 
CAGACTTACAAATACAGAGATATGAACAAAGTTTACAGTGGGAACAAAGGTTTAAAAAAA 
GGTTGTGGTTCTCTCTCTGTGATCCAGTGTGCACATAAACCTTTCTCTGATCTTTCACTG 

TGCTGCTGCTTTGACAGTAAAGAGMGGAAGTATTCTGATTAGCTGTATCTGGTATTAAT 
TGCATGTTAAAACACTGGAATTTTTAAAATTGAAATTAGATCAGTCATTCTTTTCTTTTC 
TCAAGATATCTCATGGCTGACACTGAAGAAGAAATGTAATTCATAACTTGCACTAAATGT 
ATATTTTTTTTCTTAAAAATTTACCATTCTTATTTATATTTTTATGGATTAAAATTTATA 
AAATACAGATCAGTTAATATTGCACTTAAGTAATTTTACCTTTTTAATGTGATTTTTATA 
[G,A] 

AATAATTCAGACTTACAAATACAGAGATATGAACAAAGTTTACAGTGGGAACAAAGGTTT 
AAAAAAAGGTTGTGGTTCTCTCTCTGTGATCCAGTGTGCACATAAACCTTTCTCTGATCT 
TTCACTGCCATCCTCTGGATTATGTCTTCTGACCTGTCCATTTTGACCCATTAACTGGAA 
AGTTGAAAAACTACATTAACTGGAAAGTTGAAAAACTACATTACTTTGGAGAATAAAACC 
GAAAGTTCGTGTATACCTTCTTAAAAAAAAAATCAAACCAAAAATGTGAAAACAATAGAA 

AAAAAAGGTTGTGGTTCTCTCTCTGTGATCCAGTGTGCACATAAACCTTTCTCTGATCTT 
TCACTGCCATCCTCTGGATTATGTCTTCTGACCTGTCCATTTTGACCCATTAACTGGAAA 
GTTGAAAAACTACATTAACTGGAAAGTTGAAAAACTACATTACTTTGGAGAATAAAACCG 
AAAGTTCGTGTATACCTTCTTAAAAAAAAAATCAAACCAAAAATGTGAAAACAATAGAAT 
TGCAAAGATAGCAGTTAAAATTTTAATCTGAAAATAACCTTTGAATCTCGGGCTAGGTTA 
[T,C] 

GTCCATATTTGAAGTGGTCAGTGATGGTTTGAACATTTTTTGCAGGATGAGTTAAAATGC 
ACTGGATTATATTTGGGATTTTTGTTTTTGGAATTGTCTGTTTTAATCACAGCCTTAATT 
CACAATTGGCAAAGGCAGTTTACTCAAAGGACTGGGCTAAATATTCTGTAATTATGCATT 
TTTGATAGGAAAATGAAATTTTTGCAAACAGACATTTTCTTTTTTTTTGGCTGGAGTGCA 
GTGGGGCATGG'rCTTGGCTCACTGCAGCGTTGACCACCTGGGCTCAAGTGATACTCCCGC 
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